Implementation of the Auditory Sense for the Smart Robot: Speaker/Speech Recognition

로봇 시스템에의 적용을 위한 음성 및 화자인식 알고리즘

  • Published : 2007.05.10

Abstract

We will introduce speech/speaker recognition algorithm for the isolated word. In general case of speaker verification, Gaussian Mixture Model (GMM) is used to model the feature vectors of reference speech signals. On the other hand, Dynamic Time Warping (DTW) based template matching technique was proposed for the isolated word recognition in several years ago. We combine these two different concepts in a single method and then implement in a real time speaker/speech recognition system. Using our proposed method, it is guaranteed that a small number of reference speeches (5 or 6 times training) are enough to make reference model to satisfy 90% of recognition performance.

Keywords