MALSORI (대한음성학회지:말소리)
- Issue 44
- /
- Pages.145-156
- /
- 2002
- /
- 1226-1173(pISSN)
A Reduction of Speech Database in Corpus-based Speech Synthesis System
코퍼스기반 음성합성기의 데이터베이스 감축방안
- Jang Kyung-Ae (KT) ;
- Chung Min-Hwa ;
- Kim Jae-In (KT) ;
- Koo Myoung-Wan (KT)
- Published : 2002.12.01
Abstract
This paper describes the reduction of DB without degradation of speech quality in Corpus-based Speech synthesizer of the Korean language. In this paper, it is proposed that the frequency of every unit in reduced DB reflect the frequency of units in the Korean language. So, the target population of every unit is set to be proportional to its frequency in Korean large corpus (780k sentences, 45Mega phones). Secondly, the frequent instances during synthesis should be also maintained in reduced DB. To the last, it is proposed that frequency of every instance be reflected in clustering criteria and used as another important criterion for selection of representative instances. The evaluation result with proposed methods reveals better quality than that using conventional methods.