Spontaneous Speech Language Modeling using N-gram based Similarity

Park Young-Hee;Chung Minhwa;

대한음성학회지:말소리 (MALSORI)

제46호
/
Pages.117-126
/
2003
/
1226-1173(pISSN)

대한음성학회 (The Korean Society Of Phonetic Sciences And Speech Technology)

N-gram 기반의 유사도를 이용한 대화체 연속 음성 언어 모델링

Spontaneous Speech Language Modeling using N-gram based Similarity

박영희 (서강대) ;
정민화 (서강대)

발행 : 2003.06.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

This paper presents our language model adaptation for Korean spontaneous speech recognition. Korean spontaneous speech is observed various characteristics of content and style such as filled pauses, word omission, and contraction as compared with the written text corpus. Our approaches focus on improving the estimation of domain-dependent n-gram models by relevance weighting out-of-domain text data, where style is represented by n-gram based tf/sup */idf similarity. In addition to relevance weighting, we use disfluencies as Predictor to the neighboring words. The best result reduces 9.7% word error rate relatively and shows that n-gram based relevance weighting reflects style difference greatly and disfluencies are good predictor also.

대한음성학회지:말소리 (MALSORI)

N-gram 기반의 유사도를 이용한 대화체 연속 음성 언어 모델링

Spontaneous Speech Language Modeling using N-gram based Similarity

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)