Proceedings of the KSPS conference (대한음성학회:학술대회논문집)
- 2003.05a
- /
- Pages.83-86
- /
- 2003
Language Model Adaptation for Conversational Speech Recognition
대화체 연속음성 인식을 위한 언어모델 적응
Abstract
This paper presents our style-based language model adaptation for Korean conversational speech recognition. Korean conversational speech is observed various characteristics of content and style such as filled pauses, word omission, and contraction as compared with the written text corpora. For style-based language model adaptation, we report two approaches. Our approaches focus on improving the estimation of domain-dependent n-gram models by relevance weighting out-of-domain text data, where style is represented by n-gram based tf*idf similarity. In addition to relevance weighting, we use disfluencies as predictor to the neighboring words. The best result reduces 6.5% word error rate absolutely and shows that n-gram based relevance weighting reflects style difference greatly and disfluencies are good predictor.
Keywords