Adaptive Wavelet Based Speech Enhancement with Robust VAD in Non-stationary Noise Environment

  • Sungwook Chang (School of Electrical and Computer Engineering, Hanyang University) ;
  • Sungil Jung (School of Electrical and Computer Engineering, Hanyang University) ;
  • Younghun Kwon (Department of Physics, Hanyang University) ;
  • Yang, Sung-il (School of Electrical and Computer Engineering, Hanyang University)
  • Published : 2003.12.01

Abstract

We present an adaptive wavelet packet based speech enhancement method with robust voice activity detection (VAD) in non-stationary noise environment. The proposed method can be divided into two main procedures. The first procedure is a VAD with adaptive wavelet packet transform. And the other is a speech enhancement procedure based on the proposed VAD method. The proposed VAD method shows remarkable performance even in low SNRs and non-stationary noise environment. And subjective evaluation shows that the performance of the proposed speech enhancement method with wavelet bases is better than that with Fourier basis.

Keywords

References

  1. D. L. Donoho, 'Denolslnq by soft thresholding,' IEEE Trans. on Information Theory, 41 (3), 613-627, 1995 https://doi.org/10.1109/18.382009
  2. I. M. Johnstone and B. W. Silverman, 'Wavelet thresh-old estimators for data with correlated noise,' J. Roy. Statist. Soc. B, 59, 319-351, 1997 https://doi.org/10.1111/1467-9868.00071
  3. Sungwook Chang, Sung-il Jung, Younghun Kwon, and Sung-il Yang, 'Speech enhancement using wavelet packet transform,' ICSLP 2002, 1809-1812, 2002
  4. Sungwook Chang, Younghun Kwon, and Sung-il Yang, 'Speech enhancement for non-stationary noise envi-ronment by adaptive wavelet packet,' ICASSP 2002, 1, 561-564, 2002
  5. H. G. Hirsch and C. Ehrlicher, 'Noise estimation techniques for robust speech recognition,' ICASSP 95, 153-156, 1995
  6. M. Berouti, R. Schwartz, and J. Makhoul, 'Enhancement of speech corrupted by acoustic noise,' ICASSP-79, 208-211, 1979