화자 식별을 위한 GMM의 혼합 성분의 개수 추정

Estimation of Mixture Numbers of GMM for Speaker Identification

  • 이윤정 (숭실대학교 정보통신 전자공학부) ;
  • 이기용 (숭실대학교 정보통신 전자공학부)
  • 발행 : 2004.06.01

초록

In general, Gaussian mixture model(GMM) is used to estimate the speaker model for speaker identification. The parameter estimates of the GMM are obtained by using the expectation-maximization (EM) algorithm for the maximum likelihood(ML) estimation. However, if the number of mixtures isn't defined well in the GMM, those parameters are obtained inappropriately. The problem to find the number of components is significant to estimate the optimal parameter in mixture model. In this paper, to estimate the optimal number of mixtures, we propose the method that starts from the sufficient mixtures, after, the number is reduced by investigating the mutual information between mixtures for GMM. In result, we can estimate the optimal number of mixtures. The effectiveness of the proposed method is shown by the experiment using artificial data. Also, we performed the speaker identification applying the proposed method comparing with other approaches.

키워드