대각공분산 GMM에 최적인 선형변환을 이용한 강인한 화자식별

Robust Speaker Identification Using Linear Transformation Optimized for Diagonal Covariance GMM

  • 김민석 (서울시립대학교 컴퓨터과학부) ;
  • 양일호 (서울시립대학교 컴퓨터과학부) ;
  • 유하진 (서울시립대학교 컴퓨터과학부)
  • 발행 : 2008.03.30

초록

We have been building a text-independent speaker recognition system that is robust to unknown channel and noise environments. In this paper, we propose a linear transformation to obtain robust features. The transformation is optimized to maximize the distances between the Gaussian mixtures. We use rotation of the axes, to cope with the problem of scaling the transformation matrix. The proposed transformation is similar to PCA or LDA, but can achieve better result in some special cases where PCA and LDA can not work properly. We use YOHO database to evaluate the proposed method and compare the result with PCA and LDA. The results show that the proposed method outperforms all the baseline, PCA and LDA.

키워드