Performance Improvement ofSpeech Recognition Based on SPLICEin Noisy Environments

Kim, Jong-Hyeon;Song, Hwa-Jeon;Lee, Jong-Seok;Kim, Hyung-Soon;

MALSORI (대한음성학회지:말소리)

Issue 53
/
Pages.103-118
/
2005
/
1226-1173(pISSN)

The Korean Society Of Phonetic Sciences And Speech Technology (대한음성학회)

Performance Improvement ofSpeech Recognition Based on SPLICEin Noisy Environments

SPLICE 방법에 기반한 잡음 환경에서의 음성 인식 성능 향상

김종현 (부산대학교 전자공학과 음성통신연구실) ;
송화진 (부산대학교 전자공학과 음성통신연구실) ;
이종석 (튜브미디어) ;
김형순 (부산대학교 전자공학과 음성통신연구실)

Published : 2005.03.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

The performance of speech recognition system is degraded by mismatch between training and test environments. Recently, Stereo-based Piecewise LInear Compensation for Environments (SPLICE) was introduced to overcome environmental mismatch using stereo data. In this paper, we propose several methods to improve the conventional SPLICE and evaluate them in the Aurora2 task. We generalize SPLICE to compensate for covariance matrix as well as mean vector in the feature space, and thereby yielding the error rate reduction of 48.93%. We also employ the weighted sum of correction vectors using posterior probabilities of all Gaussians, and the error rate reduction of 48.62% is achieved. With the combination of the above two methods, the error rate is reduced by 49.61% from the Aurora2 baseline system.

MALSORI (대한음성학회지:말소리)

Performance Improvement ofSpeech Recognition Based on SPLICEin Noisy Environments

SPLICE 방법에 기반한 잡음 환경에서의 음성 인식 성능 향상

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)