DOI QR코드

DOI QR Code

The Performance Improvement of Speech Recognition System based on Stochastic Distance Measure

  • Jeon, B.S. (School of Electrical & Computer Engineering, Chungbuk National University) ;
  • Lee, D.J. (School of Electrical & Computer Engineering, Chungbuk National University) ;
  • Song, C.K. (School of Electrical & Computer Engineering, Chungbuk National University) ;
  • Lee, S.H. (School of Electrical and Computer Engineering, Pusan National University) ;
  • Ryu, J.W. (School of Electrical & Computer Engineering, Chungbuk National University)
  • 발행 : 2004.09.01

초록

In this paper, we propose a robust speech recognition system under noisy environments. Since the presence of noise severely degrades the performance of speech recognition system, it is important to design the robust speech recognition method against noise. The proposed method adopts a new distance measure technique based on stochastic probability instead of conventional method using minimum error. For evaluating the performance of the proposed method, we compared it with conventional distance measure for the 10-isolated Korean digits with car noise. Here, the proposed method showed better recognition rate than conventional distance measure for the various car noisy environments.

키워드

참고문헌

  1. L. R. Rabiner and B. H. Juang, Fundamentals of speech recognition, Prentice-Hall, 1993
  2. B. H. Juang, 'Speech recognition in adverse environments', Computer Speech and Language, Vol. 5, pp. 275-294, 1991 https://doi.org/10.1016/0885-2308(91)90011-E
  3. H. W. Ruehl, S. Dobler, J. Weith and P. Meyer, 'Speech Recognition in the Noisy Car Environment', Speech Commun., Vol. 10, No.1, pp. 11-22, Feb, 1991 https://doi.org/10.1016/0167-6393(91)90024-N
  4. D. Mansour and B. J. Juang, 'The Short-time modified coherence representation and its application for noisy speech recognition', IEEE Trans. ASSP., Vol. 37, No.6, pp. 795-804, 1989 https://doi.org/10.1109/ASSP.1989.28053
  5. H. Hermansky, N. Morgan and H. G. Hirsh, 'Recognition of speech in additve and convolutional noise based on RASTA spectral processing', Proc. ICASSP, pp., 1993
  6. T. H. Applebaun and B. A. Hanson, 'Regression feature for recognition of speech on quiet and in noise', Proc., ICASSP, pp. 985-988, 1991
  7. D. Mansour and B. J. Juang, 'A family of distortion measures based upon projection operation for robust speech recognition', IEEE, Trans., ASSP, Vol. 37, No. 11, pp. 1659-1671, 1989
  8. S. F. Boll, 'Suppression of acoustic noise in speech using spectral subtraction', IEEE, Trans., ASSP, Vol. 37, No.2, pp. 113-120, 1979
  9. P. Lockwood and J. Boudy, 'Experiments with a nonlinear spectral subtracter (nss), hidden markov models and the projection, for robust speech recognition in cars', Speech Commn., Vol. 11, pp. 215-228, 1992 https://doi.org/10.1016/0167-6393(92)90016-Z
  10. O. Ghitza, 'Auditory nerve representation as a front-end for speech recognition in a noisy environment', Computer, Speech and Language, Vol. 1, pp. 109-130, 1986 https://doi.org/10.1016/S0885-2308(86)80018-3
  11. Stephane Mallat, A wavelet tour of signal processing, Academic press, 1999
  12. Gowdy, J. N. and Tufekcim, Z. 'Mel-scaled discrete wavelet coefficients for speech recognition', Proc., ICASSP, Vol. 3, pp. 1351-1354, 2000
  13. Yu Hao and Xiaoyan Zhu, 'A new feature in speech recognition based on wavelet transform', Proc., ICASSP, Vol. 3, pp. 1526-1529, 2000
  14. Kidae Kim, Dae Hee Youn, Chul Hee Lee, 'Evaluation of wavelet filters for speech recognition', Systems, Mans, and Cybernetics, Vol. 4, pp. 2891-2894, 2000