Improvement of Speech Reconstructed from MFCC Using GMM

Choi, Won-Young;Choi, Mu-Yeol;Kim, Hyung-Soon;

대한음성학회지:말소리 (MALSORI)

제53호
/
Pages.129-141
/
2005
/
1226-1173(pISSN)

대한음성학회 (The Korean Society Of Phonetic Sciences And Speech Technology)

GMM을 이용한 MFCC로부터 복원된 음성의 개선

Improvement of Speech Reconstructed from MFCC Using GMM

최원영 (부산대학교 전자공학과 음성통신연구실) ;
최무열 (부산대학교 전자공학과 음성통신연구실) ;
김형순 (부산대학교 전자공학과 음성통신연구실)

발행 : 2005.03.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

The goal of this research is to improve the quality of reconstructed speech in the Distributed Speech Recognition (DSR) system. For the extended DSR, we estimate the variable Maximum Voiced Frequency (MVF) from Mel-Frequency Cepstral Coefficient (MFCC) based on Gaussian Mixture Model (GMM), to implement realistic harmonic plus noise model for the excitation signal. For the standard DSR, we also make the voiced/unvoiced decision from MFCC based on GMM because the pitch information is not available in that case. The perceptual test reveals that speech reconstructed by the proposed method is preferred to the one by the conventional methods.

대한음성학회지:말소리 (MALSORI)

GMM을 이용한 MFCC로부터 복원된 음성의 개선

Improvement of Speech Reconstructed from MFCC Using GMM

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)