Weighted filter bank analysis and model adaptation for improving the recognition performance of partially corrupted speech

부분 손상된 음성의 인식성능 향상을 위한 가중 필터뱅크 분석 및 모델 적응

  • Published : 2002.12.01

Abstract

We propose a weighted filter bank analysis and model adaptation (WFBA-MA) scheme to improve the utilization of uncorrupted or less severely corrupted frequency regions for robust speech recognition. A weighted met frequency cepstral coefficient is obtained by weighting log filter bank energies with reliability coefficients and hidden Markov models are also modified to reflect the local reliabilities. Experimental results on TIDIGITS database corrupted by band-limited noises and car noise indicated that the proposed WFBA-MA scheme utilizes the uncorrupted speech information well, significantly improving recognition performance in comparison to multi-band speech recognition systems.

Keywords