Performance Comparison of Multiple-Model Speech Recognizer with Multi-Style Training Method Under Noisy Environments

Yoon, Jang-Hyuk;Chung, Young-Joo;

The Journal of the Acoustical Society of Korea

제29권2E호
/
Pages.100-106
/
2010
/
1225-4428(pISSN)

한국음향학회 (The Acoustical Society of Korea)

잡음 환경하에서의 다 모델 기반인식기와 다 스타일 학습방법과의 성능비교

Performance Comparison of Multiple-Model Speech Recognizer with Multi-Style Training Method Under Noisy Environments

윤장혁 ;
정용주

Yoon, Jang-Hyuk (Department of Electronics Engineering, Keimyung University) ;
Chung, Young-Joo (Department of Electronics Engineering, Keimyung University)

투고 : 2010.05.07
심사 : 2010.06.07
발행 : 2010.06.30

PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

Multiple-model speech recognizer has been shown to be quite successful in noisy speech recognition. However, its performance has usually been tested using the general speech front-ends which do not incorporate any noise adaptive algorithms. For the accurate evaluation of the effectiveness of the multiple-model frame in noisy speech recognition, we used the state-of-the-art front-ends and compared its performance with the well-known multi-style training method. In addition, we improved the multiple-model speech recognizer by employing N-best reference HMMs for interpolation and using multiple SNR levels for training each of the reference HMM.

키워드

참고문헌

M. J. F. Gales, "Model based techniques for noise-robust speech recognition", Ph.D. Dissertation, University of Cambridge, 1995.
P. J. Moreno, "Speech recognition in noisy environments", Ph.D. Dissertation, Carnegie Mellon University, 1996.
S. F. Ball. "Suppression of acoustic noise in speech using spectral subtraction", IEEE Trans. Acoust., Speech, Signal Process., vol. 27, pp.113-120, 1979. https://doi.org/10.1109/TASSP.1979.1163209
H. Xu, Z.-H. Tan, P. Dalsgaard and B. Lindberg, "Robust Speech Recognition on Noise and SNR Classification-a Multiple-Model Framework", in Proc. Interspeech, 2005.
ETSI draft standard doc. Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Front-end feature extraction algorithm Compression algorithm, ETSI Standard ES 202 108., 2000.
ETSI draft standard doc. Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Advanced Front-end feature extraction algorithm; Compression algorithm, ETSI Standard ES 202 050. 2002.
D. Macho, L. Mauuary, B. Noe, Y. Cheng, D. Eahey, D. Jouvet, H. Kelleher, D. Pearce, F. Saadoun, "Evaluation of a noise-robust DSR front-end on Aurora databases", in Proc. ICSLP, pp. 17-20, 2002.
B. H. Juang and L. R. Rabiner, "A Probabilistic Distance Measure for Hidden Markov Models", AT&T Technology Journal, pp. 391-408, 1984.
D. Pearce and H. Hirsch, The Aurora experimental framework for the performance evaluation of speech recognition systems under conditions", in Proc. ICSLP, pp.29-32, 2000.

The Journal of the Acoustical Society of Korea

잡음 환경하에서의 다 모델 기반인식기와 다 스타일 학습방법과의 성능비교

Performance Comparison of Multiple-Model Speech Recognizer with Multi-Style Training Method Under Noisy Environments

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)