Proceedings of the KSPS conference (대한음성학회:학술대회논문집)
- 2003.05a
- /
- Pages.23-26
- /
- 2003
Common Speech Database Collection for Telecommunications
통신망환경 한국어 공통음성 DB 구축
Abstract
This paper presents common speech database collection for telecommunication applications. During 3 year project, we will construct very large scale speech and text databases for speech recognition, speech synthesis, and speaker identification. The common speech database has been considered various communication environments, distribution of speakers' sex, distribution of speakers' age, and distribution of speakers' region. It consists of Korean continuous digit, isolated words, and sentences which reflects Korean phonetic coverage. In addition, it consists of various pronunciation style such as read speech, dialogue speech, and semi-spontaneous speech. Thanks to the common speech databases, the duplicated resources of Korean speech industries are prohibited. It encourages domestic speech industries and activate speech technology domestic market.
Keywords