Speech Coarticulation Database of Korean and English

;Stephen A. Dyer;Dwight D. Day;

한국음향학회지 (The Journal of the Acoustical Society of Korea)

제18권3호
/
Pages.17-26
/
1999
/
1225-4428(pISSN)
/
2287-3775(eISSN)

한국음향학회 (The Acoustical Society of Korea)

한$\cdot$ 영 동시조음 데이터베이스의 구축

Speech Coarticulation Database of Korean and English

김종미 (강원대학교 영어영문학과) ;
;

;
Stephen A. Dyer (EECE, Kansas State University) ;
Dwight D. Day (EECE, Kansas State University)

발행 : 1999.04.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

We present the first speech coarticulation database of Korean, English and Konglish/sup 3)/ named "SORIDA"/sup 4)/, which is designed to cover the maximum number of representations of coarticulation in these languages [1]. SORIDA features a compact database which is designed to contain a maximum number of triphones in a minimum number of prompts. SORIDA contains all consonantal triphones and vowel allophones in 682 Korean prompts of word length and in 717 English prompt words, spoken five times by speakers of balanced genders, dialects and ages. Korean prompts are synthesized lexicons which maximize their coarticulation variation disregarding any stress phenomena, while English prompts are natural words that fully reflect their stress effects with respect to the coarticulation variation. The prompts are designed differently because English phonology has stress while Korean does not. An intermediate language, Konglish has also been modeled by two Korean speakers reading 717 English prompt words. Recording was done in a controlled laboratory environment with an AKG Model C-100 microphone and a Fostex D-5 digital-audio-tape (DAT) recorder. The total recording time lasted four hours. SORIDA CD-ROM is available in one disk of 22.05 kHz sampling rate with a 16 bit sample size. SORIDA digital audio-tapes are available in four 124-minute-tapes of 48 kHz sampling rate. SORIDA′s list of phonetically-rich-words is also available in English and Korean.

키워드

참고문헌

First LREC conf. proc. Construction of a speech translation database Kim, Jong-mi;S. A. Dyer;D. D. Day
Human Language Technology Workshop proc. Tree-based state typing for high accuracy acoustic modeling Young, S. J.;J. J. Odell;P. C. Woodland
Speech Resources Caralogue, web site, European Language Resources Association
Speech corpora, web site, Linguistic Data Consortium
한국음향학회지 v.13 no.6 음성 인식/합성을 위한 국어의 음성-음운론적 특성연구 정국;구희산;이찬도;김종미;한선희
Paper presented at the Second Symposium on Advanced Man-Machine Interface through Spoken Language Transcription and Alignment of the Timit Database Zue;Victor W.;Stephanie Seneff
TIMIT: Acoustic-Phonetic Continuous Speech Corpus, CD-ROM, Distributed by LDC, UPENN

한국음향학회지 (The Journal of the Acoustical Society of Korea)

한$\cdot$ 영 동시조음 데이터베이스의 구축

Speech Coarticulation Database of Korean and English

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)