Robust Speaker Identification Using Linear Transformation Optimized for Diagonal Covariance GMM

Kim, Min-Seok;Yang, Il-Ho;Yu, Ha-Jin;

대한음성학회지:말소리 (MALSORI)

제65호
/
Pages.67-80
/
2008
/
1226-1173(pISSN)

대한음성학회 (The Korean Society Of Phonetic Sciences And Speech Technology)

대각공분산 GMM에 최적인 선형변환을 이용한 강인한 화자식별

Robust Speaker Identification Using Linear Transformation Optimized for Diagonal Covariance GMM

김민석 (서울시립대학교 컴퓨터과학부) ;
양일호 (서울시립대학교 컴퓨터과학부) ;
유하진 (서울시립대학교 컴퓨터과학부)

발행 : 2008.03.30

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

We have been building a text-independent speaker recognition system that is robust to unknown channel and noise environments. In this paper, we propose a linear transformation to obtain robust features. The transformation is optimized to maximize the distances between the Gaussian mixtures. We use rotation of the axes, to cope with the problem of scaling the transformation matrix. The proposed transformation is similar to PCA or LDA, but can achieve better result in some special cases where PCA and LDA can not work properly. We use YOHO database to evaluate the proposed method and compare the result with PCA and LDA. The results show that the proposed method outperforms all the baseline, PCA and LDA.

대한음성학회지:말소리 (MALSORI)

대각공분산 GMM에 최적인 선형변환을 이용한 강인한 화자식별

Robust Speaker Identification Using Linear Transformation Optimized for Diagonal Covariance GMM

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)