Speech/Music Discrimination Using Multi-dimensional MMCD

Choi, Mu-Yeol;Song, Hwa-Jeon;Park, Seul-Han;Kim, Hyung-Soon;

Proceedings of the KSPS conference (대한음성학회:학술대회논문집)

2006.11a
/
Pages.142-145
/
2006

The Korean Society Of Phonetic Sciences And Speech Technology (대한음성학회)

Speech/Music Discrimination Using Multi-dimensional MMCD

다차원 MMCD를 이용한 음성/음악 판별

Choi, Mu-Yeol (Department of Electronics Engineering, Pusan National University) ;
Song, Hwa-Jeon (Department of Electronics Engineering, Pusan National University) ;
Park, Seul-Han (Department of Electronics Engineering, Pusan National University) ;
Kim, Hyung-Soon (Department of Electronics Engineering, Pusan National University)

최무열 (부산대학교 전자공학과) ;
송화전 (부산대학교 전자공학과) ;
박슬한 (부산대학교 전자공학과) ;
김형순 (부산대학교 전자공학과)

Published : 2006.11.17

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Discrimination between speech and music is important in many multimedia applications. Previously we proposed a new parameter for speech/music discrimination, the mean of minimum cepstral distances (MMCD), and it outperformed the conventional parameters. One weakness of it is that its performance depends on range of candidate frames to compute the minimum cepstral distance, which requires the optimal selection of the range experimentally. In this paper, to alleviate the problem, we propose a multi-dimensional MMCD parameter which consists of multiple MMCDs with different ranges of candidate frames. Experimental results show that the multi-dimensional MMCD parameter yields an error rate reduction of 22.5% compared with the optimally chosen one-dimensional MMCD parameter.

Proceedings of the KSPS conference (대한음성학회:학술대회논문집)

Speech/Music Discrimination Using Multi-dimensional MMCD

다차원 MMCD를 이용한 음성/음악 판별

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)