잡음 환경하에서의 다 모델 기반인식기와 다 스타일 학습방법과의 성능비교

Performance Comparison of Multiple-Model Speech Recognizer with Multi-Style Training Method Under Noisy Environments

  • Yoon, Jang-Hyuk (Department of Electronics Engineering, Keimyung University) ;
  • Chung, Young-Joo (Department of Electronics Engineering, Keimyung University)
  • 투고 : 2010.05.07
  • 심사 : 2010.06.07
  • 발행 : 2010.06.30

초록

Multiple-model speech recognizer has been shown to be quite successful in noisy speech recognition. However, its performance has usually been tested using the general speech front-ends which do not incorporate any noise adaptive algorithms. For the accurate evaluation of the effectiveness of the multiple-model frame in noisy speech recognition, we used the state-of-the-art front-ends and compared its performance with the well-known multi-style training method. In addition, we improved the multiple-model speech recognizer by employing N-best reference HMMs for interpolation and using multiple SNR levels for training each of the reference HMM.

키워드

참고문헌

  1. M. J. F. Gales, "Model based techniques for noise-robust speech recognition", Ph.D. Dissertation, University of Cambridge, 1995.
  2. P. J. Moreno, "Speech recognition in noisy environments", Ph.D. Dissertation, Carnegie Mellon University, 1996.
  3. S. F. Ball. "Suppression of acoustic noise in speech using spectral subtraction", IEEE Trans. Acoust., Speech, Signal Process., vol. 27, pp.113-120, 1979. https://doi.org/10.1109/TASSP.1979.1163209
  4. H. Xu, Z.-H. Tan, P. Dalsgaard and B. Lindberg, "Robust Speech Recognition on Noise and SNR Classification-a Multiple-Model Framework", in Proc. Interspeech, 2005.
  5. ETSI draft standard doc. Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Front-end feature extraction algorithm Compression algorithm, ETSI Standard ES 202 108., 2000.
  6. ETSI draft standard doc. Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Advanced Front-end feature extraction algorithm; Compression algorithm, ETSI Standard ES 202 050. 2002.
  7. D. Macho, L. Mauuary, B. Noe, Y. Cheng, D. Eahey, D. Jouvet, H. Kelleher, D. Pearce, F. Saadoun, "Evaluation of a noise-robust DSR front-end on Aurora databases", in Proc. ICSLP, pp. 17-20, 2002.
  8. B. H. Juang and L. R. Rabiner, "A Probabilistic Distance Measure for Hidden Markov Models", AT&T Technology Journal, pp. 391-408, 1984.
  9. D. Pearce and H. Hirsch, The Aurora experimental framework for the performance evaluation of speech recognition systems under conditions", in Proc. ICSLP, pp.29-32, 2000.