Speaker Identification in Small Training Data Environment using MLLR Adaptation Method

Kim, Se-hyun;Oh, Yung-Hwan;

대한음성학회:학술대회논문집 (Proceedings of the KSPS conference)

대한음성학회 (The Korean Society Of Phonetic Sciences And Speech Technology)

MLLR 화자적응 기법을 이용한 적은 학습자료 환경의 화자식별

Speaker Identification in Small Training Data Environment using MLLR Adaptation Method

김세현 (한국과학기술원 전자전산학과 전산학전공 음성인터페이스연구실) ;
오영환 (한국과학기술원 전자전산학과 전산학전공 음성인터페이스연구실)

Kim, Se-hyun (Voice Interface Lab. Div of Computer Science, Dept. of Electrical Engineering and Computer Science, KAIST) ;
Oh, Yung-Hwan (Voice Interface Lab. Div of Computer Science, Dept. of Electrical Engineering and Computer Science, KAIST)

발행 : 2005.11.17

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

Identification is the process automatically identify who is speaking on the basis of information obtained from speech waves. In training phase, each speaker models are trained using each speaker's speech data. GMMs (Gaussian Mixture Models), which have been successfully applied to speaker modeling in text-independent speaker identification, are not efficient in insufficient training data environment. This paper proposes speaker modeling method using MLLR (Maximum Likelihood Linear Regression) method which is used for speaker adaptation in speech recognition. We make SD-like model using MLLR adaptation method instead of speaker dependent model (SD). Proposed system outperforms the GMMs in small training data environment.

대한음성학회:학술대회논문집 (Proceedings of the KSPS conference)

MLLR 화자적응 기법을 이용한 적은 학습자료 환경의 화자식별

Speaker Identification in Small Training Data Environment using MLLR Adaptation Method

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)