MALSORI (대한음성학회지:말소리)
- Issue 58
- /
- Pages.83-99
- /
- 2006
- /
- 1226-1173(pISSN)
Adaptation and Clustering Method for Speaker Identification with Small Training Data
화자적응과 군집화를 이용한 화자식별 시스템의 성능 및 속도 향상
Abstract
One key factor that hinders the widespread deployment of speaker identification technologies is the requirement of long enrollment utterances to guarantee low error rate during identification. To gain user acceptance of speaker identification technologies, adaptation algorithms that can enroll speakers with short utterances are highly essential. To this end, this paper applies MLLR speaker adaptation for speaker enrollment and compares its performance against other speaker modeling techniques: GMMs and HMM. Also, to speed up the computational procedure of identification, we apply speaker clustering method which uses principal component analysis (PCA) and weighted Euclidean distance as distance measurement. Experimental results show that MLLR adapted modeling method is most effective for short enrollment utterances and that the GMMs performs better when long utterances are available.