• Title/Summary/Keyword: Vocal Tract Modeling

Search Result 4, Processing Time 0.017 seconds

Vocal Tract Modeling with Unfixed Sectionlength Acoustic Tubes(USLAT) (비고정 구간 길이 음향 튜브를 이용한 성도 모델링)

  • Kim, Dong-Jun
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.59 no.6
    • /
    • pp.1126-1130
    • /
    • 2010
  • Speech production can be viewed as a filtering operation in which a sound source excites a vocal tract filter. The vocal tract is modeled as a chain of cylinders of varying cross-sectional area in linear prediction acoustic tube modeling. In this modeling the most common implementation assumes equal length of tube sections. Therefore, to model complex vocal tract shapes, a large number of tube sections are needed. This paper proposes a new vocal tract model with unfixed sectionlengths, which uses the reduced lattice filter for modeling the vocal tract. This model transforms the lattice filter to reduced structure and the Burg algorithm to modified version. When the conventional and the proposed models are implemented with the same order of linear prediction analysis, the proposed model can produce more accurate results than the conventional one. To implement a system within similar accuracy level, it may be possible to reduce the stages of the lattice filter structure. The proposed model produces the more similar vocal tract shape than the conventional one.

A 3D Vocal Tract Modeling and Vowel Discrimination of Korean Monophthongs [이, 에, 아, 오, 우, 으] (한국어 단모음 [이, 에, 아, 오, 우, 으]에 대한 성도 3차원 모델링 및 모음 판별)

  • Seong, Cheol-Jae;Park, Jong-won;Kim, Gui-Ryong
    • Proceedings of the KSPS conference
    • /
    • 2005.11a
    • /
    • pp.185-188
    • /
    • 2005
  • We presents a new method for the measurement and analysis of the volume of the vocal tract using 3D magnetic resonance image. The relative ratios of volume A, B, and C, which are divided by the 2constriction points formed on the horizontal and vertical plane in vocal tract, take a decisive role indiscriminating Korean monophthong. Together with Fl-F2 and the minimum cross sectional area in the vocal tract, the relative ratios of the regional volumes were proved to be significant parameter in statistic viewpoint.

  • PDF

Hunminjeongeum Phonetics (I): Phonetic and Phoniatric Consideration for Explanation of Designs of Middle Vowel Letters (훈민정음 음성학(I): 중성자(홀소리) 제자해에 대한 음성언어의학적 고찰)

  • Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.33 no.2
    • /
    • pp.77-82
    • /
    • 2022
  • Hunminjeongeum was made by the Great King Sejong, and composed of 17 consonant and 11 vowel letters. All the 28 letters were made according to the shape of vocal organ or space at the point of articulation for each letters. This review article focused on phonetic and phoniatric consideration for explanation of the designs of the middle vowel letters, especially three main vowel letters [ • (天, heaven), ㅡ (地, earth), ㅣ (人, human)] using video-fluoroscopic evaluation as well as computed tomography scanning, etc. During articulating / • / sound, a ball-like space at frontal portion of the oral cavity was found, tongue was contracted, and sound was deep (舌縮而聲深). During /ㅡ/ sound, a flat air space between oral tongue and hard palate was created. Tongue was slightly contacted neither deep nor shallow (舌小縮而聲不深不淺). During /ㅣ/ sound, tongue was not contacted and Sound is light (舌不縮而聲淺). Tongue was moved forward making longitudinal oro-pharyngeal air space. So, I'd like to suggest that we had better change the explanation drawing from a philosophical modeling to a more scientific modeling from real vocal tract space modeling during articulating middle vowels of Hunminjeongeum.

VOICE SOURCE ESTIMATION USING SEQUENTIAL SVD AND EXTRACTION OF COMPOSITE SOURCE PARAMETERS USING EM ALGORITHM

  • Hong, Sung-Hoon;Choi, Hong-Sub;Ann, Sou-Guil
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.893-898
    • /
    • 1994
  • In this paper, the influence of voice source estimation and modeling on speech synthesis and coding is examined and then their new estimation and modeling techniques are proposed and verified by computer simulation. It is known that the existing speech synthesizer produced the speech which is dull and inanimated. These problems are arised from the fact that existing estimation and modeling techniques can not give more accurate voice parameters. Therefore, in this paper we propose a new voice source estimation algorithm and modeling techniques which can not give more accurate voice parameters. Therefore, in this paper we propose a new voice source estimation algorithm and modeling techniques which can represent a variety of source characteristics. First, we divide speech samples in one pitch region into four parts having different characteristics. Second, the vocal-tract parameters and voice source waveforms are estimated in each regions differently using sequential SVD. Third, we propose composite source model as a new voice source model which is represented by weighted sum of pre-defined basis functions. And finally, the weights and time-shift parameters of the proposed composite source model are estimeted uning EM(estimate maximize) algorithm. Experimental results indicate that the proposed estimation and modeling methods can estimate more accurate voice source waveforms and represent various source characteristics.

  • PDF