Phonetic Tied-Mixture Syllable Model for Efficient Decoding in Korean ASR

Kim Bong-Wan;Lee Yong-Jn;

MALSORI (대한음성학회지:말소리)

Issue 50
/
Pages.139-150
/
2004
/
1226-1173(pISSN)

The Korean Society Of Phonetic Sciences And Speech Technology (대한음성학회)

Phonetic Tied-Mixture Syllable Model for Efficient Decoding in Korean ASR

효율적 한국어 음성 인식을 위한 PTM 음절 모델

Kim Bong-Wan (SiTEC) ;
Lee Yong-Jn

김봉완 ;
이용주 (원광대)

Published : 2004.06.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

A Phonetic Tied-Mixture (PTM) model has been proposed as a way of efficient decoding in large vocabulary continuous speech recognition systems (LVCSR). It has been reported that PTM model shows better performance in decoding than triphones by sharing a set of mixture components among states of the same topological location[5]. In this paper we propose a Phonetic Tied-Mixture Syllable (PTMS) model which extends PTM technique up to syllables. The proposed PTMS model shows 13% enhancement in decoding speed than PTM. In spite of difference in context dependent modeling (PTM : cross-word context dependent modeling, PTMS : word-internal left-phone dependent modeling), the proposed model shows just less than 1% degradation in word accuracy than PTM with the same beam width. With a different beam width, it shows better word accuracy than in PTM at the same or higher speed.

MALSORI (대한음성학회지:말소리)

Phonetic Tied-Mixture Syllable Model for Efficient Decoding in Korean ASR

효율적 한국어 음성 인식을 위한 PTM 음절 모델

Abstract

Keywords