Acoustic and Pronunciation Model Adaptation Based on Context dependency for Korean-English Speech Recognition

한국인의 영어 인식을 위한 문맥 종속성 기반 음향모델/발음모델 적응

  • 오유리 (광주과학기술원 정보통신공학과 휴먼컴퓨팅 연구실) ;
  • 김홍국 (광주과학기술원 정보통신공학과 휴먼컴퓨팅 연구실) ;
  • 이연우 (목포대학교 공과대학 정보공학부 정보통신공학) ;
  • 이성로 (목포대학교 공과대학 정보공학부 정보전자공학)
  • Published : 2008.12.30

Abstract

In this paper, we propose a hybrid acoustic and pronunciation model adaptation method based on context dependency for Korean-English speech recognition. The proposed method is performed as follows. First, in order to derive pronunciation variant rules, an n-best phoneme sequence is obtained by phone recognition. Second, we decompose each rule into a context independent (CI) or a context dependent (CD) one. To this end, it is assumed that a different phoneme structure between Korean and English makes CI pronunciation variabilities while coarticulation effects are related to CD pronunciation variabilities. Finally, we perform an acoustic model adaptation and a pronunciation model adaptation for CI and CD pronunciation variabilities, respectively. It is shown from the Korean-English speech recognition experiments that the average word error rate (WER) is decreased by 36.0% when compared to the baseline that does not include any adaptation. In addition, the proposed method has a lower average WER than either the acoustic model adaptation or the pronunciation model adaptation.

Keywords