Speech/Music Discrimination Using Mel-Cepstrum Modulation Energy

Kim, Bong-Wan;Choi, Dea-Lim;Lee, Yong-Ju;

대한음성학회지:말소리 (MALSORI)

제64호
/
Pages.89-103
/
2007
/
1226-1173(pISSN)

대한음성학회 (The Korean Society Of Phonetic Sciences And Speech Technology)

멜 켑스트럼 모듈레이션 에너지를 이용한 음성/음악 판별

Speech/Music Discrimination Using Mel-Cepstrum Modulation Energy

김봉완 (음성정보기술산업지원센터) ;
최대림 (음성정보기술산업지원센터) ;
이용주 (원광대 전기 전자 및 정보공학부, 음성정보기술산업지원센터)

발행 : 2007.12.30

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

In this paper, we introduce mel-cepstrum modulation energy (MCME) for a feature to discriminate speech and music data. MCME is a mel-cepstrum domain extension of modulation energy (ME). MCME is extracted on the time trajectory of Mel-frequency cepstral coefficients, while ME is based on the spectrum. As cepstral coefficients are mutually uncorrelated, we expect the MCME to perform better than the ME. To find out the best modulation frequency for MCME, we perform experiments with 4 Hz to 20 Hz modulation frequency. To show effectiveness of the proposed feature, MCME, we compare the discrimination accuracy with the results obtained from the ME and the cepstral flux.

대한음성학회지:말소리 (MALSORI)

멜 켑스트럼 모듈레이션 에너지를 이용한 음성/음악 판별

Speech/Music Discrimination Using Mel-Cepstrum Modulation Energy

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)