DOI QR코드

DOI QR Code

Interference Suppression Using Principal Subspace Modification in Multichannel Wiener Filter and Its Application to Speech Recognition

  • Kim, Gi-Bak (Department of Electrical Engineering, University of Texas at Dallas)
  • Received : 2010.01.20
  • Accepted : 2010.09.29
  • Published : 2010.12.31

Abstract

It has been shown that the principal subspace-based multichannel Wiener filter (MWF) provides better performance than the conventional MWF for suppressing interference in the case of a single target source. It can efficiently estimate the target speech component in the principal subspace which estimates the acoustic transfer function up to a scaling factor. However, as the input signal-to-interference ratio (SIR) becomes lower, larger errors are incurred in the estimation of the acoustic transfer function by the principal subspace method, degrading the performance in interference suppression. In order to alleviate this problem, a principal subspace modification method was proposed in previous work. The principal subspace modification reduces the estimation error of the acoustic transfer function vector at low SIRs. In this work, a frequency-band dependent interpolation technique is further employed for the principal subspace modification. The speech recognition test is also conducted using the Sphinx-4 system and demonstrates the practical usefulness of the proposed method as a front processing for the speech recognizer in a distant-talking and interferer-present environment.

Keywords

References

  1. M. Brandstein and D. Ward, Eds., Microphone Arrays, Springer- Verlag, 2001.
  2. D. Florencio and H. Malvar, "Multichannel Filtering for Optimum Noise Reduction in Microphone Arrays," Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process., Salt Lake City, UT, USA, May 2001, pp. 197-200.
  3. S. Doclo and M. Moonen, "GSVD-Based Optimal Filtering for Single and Multimicrophone Speech Enhancement," IEEE Trans. Signal Process., vol. 50, no. 9, Sept. 2002, pp. 2230-2244. https://doi.org/10.1109/TSP.2002.801937
  4. A. Spriet, M. Moonen, and J. Wouters, "Robustness Analysis of Multichannel Wiener Filtering and Generalized Sidelobe Cancellation for Multimicrophone Noise Reduction in Hearing Aid Applications," IEEE Trans. Speech Audio Process., vol. 13, no. 4, July 2005, pp. 487-503.
  5. S. Doclo and M. Moonen, "Combined Frequency-Domain Dereverberation and Noise Reduction Technique for Multimicrophone Speech Enhancement," Int. Workshop Acoustic Echo Noise Control, Darmstadt, Germany, Sept. 2001, pp. 31-34.
  6. W. Herbordt, Sound Capture for Human/Machine Interfaces, Springer-Verlag, 2005.
  7. G. Kim and N.I. Cho, "Principal Subspace Modification for Multi-channel Wiener Filter in Multi-microphone Noise Reduction," Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process., 2008, pp. 4909-4912.
  8. G.H. Golub and C.F. Van Loan, Matrix Computations, Johns Hopkins University Press, 3rd ed., 1996.
  9. H. Van Trees, Detection, Estimation and Modulation Theory, Part IV: Optimum Array Processing, New York: Wiley, 2002.
  10. R.G. Leonard, "A Database for Speaker-Independent Digit Recognition," Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process., 1984, pp. 111-114.
  11. "RWCP Sound Scene Database in Real Acoustical Environments," Real World Computing Partnership, (c)1998-2001.
  12. The CMU Sphinx Group Open Source Speech Recognition Engines. Available: http://cmusphinx.sourceforge.net