SPLICE 방법에 기반한 잡음 환경에서의 음성 인식 성능 향상

Performance Improvement ofSpeech Recognition Based on SPLICEin Noisy Environments

  • 김종현 (부산대학교 전자공학과 음성통신연구실) ;
  • 송화진 (부산대학교 전자공학과 음성통신연구실) ;
  • 이종석 (튜브미디어) ;
  • 김형순 (부산대학교 전자공학과 음성통신연구실)
  • 발행 : 2005.03.01

초록

The performance of speech recognition system is degraded by mismatch between training and test environments. Recently, Stereo-based Piecewise LInear Compensation for Environments (SPLICE) was introduced to overcome environmental mismatch using stereo data. In this paper, we propose several methods to improve the conventional SPLICE and evaluate them in the Aurora2 task. We generalize SPLICE to compensate for covariance matrix as well as mean vector in the feature space, and thereby yielding the error rate reduction of 48.93%. We also employ the weighted sum of correction vectors using posterior probabilities of all Gaussians, and the error rate reduction of 48.62% is achieved. With the combination of the above two methods, the error rate is reduced by 49.61% from the Aurora2 baseline system.

키워드