MALSORI (대한음성학회지:말소리)
- Volume 68
- /
- Pages.33-47
- /
- 2008
- /
- 1226-1173(pISSN)
Acoustic and Pronunciation Model Adaptation Based on Context dependency for Korean-English Speech Recognition
한국인의 영어 인식을 위한 문맥 종속성 기반 음향모델/발음모델 적응
- Oh, Yoo-Rhee (GIST) ;
- Kim, Hong-Kook (GIST) ;
- Lee, Yeon-Woo ;
- Lee, Seong-Ro
- 오유리 (광주과학기술원 정보통신공학과 휴먼컴퓨팅 연구실) ;
- 김홍국 (광주과학기술원 정보통신공학과 휴먼컴퓨팅 연구실) ;
- 이연우 (목포대학교 공과대학 정보공학부 정보통신공학) ;
- 이성로 (목포대학교 공과대학 정보공학부 정보전자공학)
- Published : 2008.12.30
Abstract
In this paper, we propose a hybrid acoustic and pronunciation model adaptation method based on context dependency for Korean-English speech recognition. The proposed method is performed as follows. First, in order to derive pronunciation variant rules, an n-best phoneme sequence is obtained by phone recognition. Second, we decompose each rule into a context independent (CI) or a context dependent (CD) one. To this end, it is assumed that a different phoneme structure between Korean and English makes CI pronunciation variabilities while coarticulation effects are related to CD pronunciation variabilities. Finally, we perform an acoustic model adaptation and a pronunciation model adaptation for CI and CD pronunciation variabilities, respectively. It is shown from the Korean-English speech recognition experiments that the average word error rate (WER) is decreased by 36.0% when compared to the baseline that does not include any adaptation. In addition, the proposed method has a lower average WER than either the acoustic model adaptation or the pronunciation model adaptation.
Keywords