Speech/Music Discrimination Using Multi-dimensional MMCD

다차원 MMCD를 이용한 음성/음악 판별

  • Choi, Mu-Yeol (Department of Electronics Engineering, Pusan National University) ;
  • Song, Hwa-Jeon (Department of Electronics Engineering, Pusan National University) ;
  • Park, Seul-Han (Department of Electronics Engineering, Pusan National University) ;
  • Kim, Hyung-Soon (Department of Electronics Engineering, Pusan National University)
  • Published : 2006.11.17

Abstract

Discrimination between speech and music is important in many multimedia applications. Previously we proposed a new parameter for speech/music discrimination, the mean of minimum cepstral distances (MMCD), and it outperformed the conventional parameters. One weakness of it is that its performance depends on range of candidate frames to compute the minimum cepstral distance, which requires the optimal selection of the range experimentally. In this paper, to alleviate the problem, we propose a multi-dimensional MMCD parameter which consists of multiple MMCDs with different ranges of candidate frames. Experimental results show that the multi-dimensional MMCD parameter yields an error rate reduction of 22.5% compared with the optimally chosen one-dimensional MMCD parameter.

Keywords