Acoustic Modeling and Energy-Based Postprocessing for Automatic Speech Segmentation

Park Hyeyoung;Kim Hyungsoon;

MALSORI (대한음성학회지:말소리)

Issue 43
/
Pages.137-150
/
2002
/
1226-1173(pISSN)

The Korean Society Of Phonetic Sciences And Speech Technology (대한음성학회)

Acoustic Modeling and Energy-Based Postprocessing for Automatic Speech Segmentation

자동 음성 분할을 위한 음향 모델링 및 에너지 기반 후처리

박혜영 (부산대) ;
김형순 (부산대)

Published : 2002.06.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Speech segmentation at phoneme level is important for corpus-based text-to-speech synthesis. In this paper, we examine acoustic modeling methods to improve the performance of automatic speech segmentation system based on Hidden Markov Model (HMM). We compare monophone and triphone models, and evaluate several model training approaches. In addition, we employ an energy-based postprocessing scheme to make correction of frequent boundary location errors between silence and speech sounds. Experimental results show that our system provides 71.3% and 84.2% correct boundary locations given tolerance of 10 ms and 20 ms, respectively.

MALSORI (대한음성학회지:말소리)

Acoustic Modeling and Energy-Based Postprocessing for Automatic Speech Segmentation

자동 음성 분할을 위한 음향 모델링 및 에너지 기반 후처리

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)