Acoustic Channel Compensation at Mel-frequency Spectrum Domain

  • Jeong, So-Young (Brain Science Research Center(BSRC) and also Division Of Electrical Engineering, Department of Electrical Engineering and Computer Science, Korea Advanced Institute of Science and Technology) ;
  • Oh, Sang-Hoon (Department of Information Communication Engineering, Mokwon University) ;
  • Lee, Soo-Young (BSRC and also Department of BioSystems, Korea Advanced Institute of Science and Technology)
  • Published : 2003.03.01

Abstract

The effects of linear acoustic channels have been analyzed and compensated at mel-frequency feature domain. Unlike popular RASTA filtering our approach incorporates separate filters for each mel-frequency band, which results in better recognition performance for heavy-reverberated speeches.

Keywords

References

  1. H. Hermansky, 'Should recognizers have ears?,' Speech Communication, 25, 3-27, 1998 https://doi.org/10.1016/S0167-6393(98)00027-2
  2. X. Huang, A. Acero and H.-W. Hon, Spoken language processing, Prentice Hall PTR, New Jersey, 2001
  3. H. Y. Jung and S. Y. Lee, 'On the temporal decorrelation of feature parameters for noise-robust speech recognition,' IEEE Trans. Speech and Audio Processing, 8 (4), 407-416, 2000 https://doi.org/10.1109/89.848222
  4. J. B. Allen and D. A. Berkley, 'Image method for efficiently simulating small-room acoustics,' Journal of Acoustic Society of America, 65(4), 943-950, 1979 https://doi.org/10.1121/1.382599
  5. D.-S. Kim and S.-Y. Lee and R. M. Kil, 'Auditory processing of speech signals for robust speech recognition in real-world noisy environments,' IEEE Trans. Speech and Audio Processing. 7 (1). 55-69, 1999 https://doi.org/10.1109/89.736331