• 제목/요약/키워드: Vocal Tract Modeling

검색결과 4건 처리시간 0.019초

비고정 구간 길이 음향 튜브를 이용한 성도 모델링 (Vocal Tract Modeling with Unfixed Sectionlength Acoustic Tubes(USLAT))

  • 김동준
    • 전기학회논문지
    • /
    • 제59권6호
    • /
    • pp.1126-1130
    • /
    • 2010
  • Speech production can be viewed as a filtering operation in which a sound source excites a vocal tract filter. The vocal tract is modeled as a chain of cylinders of varying cross-sectional area in linear prediction acoustic tube modeling. In this modeling the most common implementation assumes equal length of tube sections. Therefore, to model complex vocal tract shapes, a large number of tube sections are needed. This paper proposes a new vocal tract model with unfixed sectionlengths, which uses the reduced lattice filter for modeling the vocal tract. This model transforms the lattice filter to reduced structure and the Burg algorithm to modified version. When the conventional and the proposed models are implemented with the same order of linear prediction analysis, the proposed model can produce more accurate results than the conventional one. To implement a system within similar accuracy level, it may be possible to reduce the stages of the lattice filter structure. The proposed model produces the more similar vocal tract shape than the conventional one.

한국어 단모음 [이, 에, 아, 오, 우, 으]에 대한 성도 3차원 모델링 및 모음 판별 (A 3D Vocal Tract Modeling and Vowel Discrimination of Korean Monophthongs [이, 에, 아, 오, 우, 으])

  • 성철재;박종원;김귀룡
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2005년도 추계 학술대회 발표논문집
    • /
    • pp.185-188
    • /
    • 2005
  • We presents a new method for the measurement and analysis of the volume of the vocal tract using 3D magnetic resonance image. The relative ratios of volume A, B, and C, which are divided by the 2constriction points formed on the horizontal and vertical plane in vocal tract, take a decisive role indiscriminating Korean monophthong. Together with Fl-F2 and the minimum cross sectional area in the vocal tract, the relative ratios of the regional volumes were proved to be significant parameter in statistic viewpoint.

  • PDF

훈민정음 음성학(I): 중성자(홀소리) 제자해에 대한 음성언어의학적 고찰 (Hunminjeongeum Phonetics (I): Phonetic and Phoniatric Consideration for Explanation of Designs of Middle Vowel Letters)

  • 최홍식
    • 대한후두음성언어의학회지
    • /
    • 제33권2호
    • /
    • pp.77-82
    • /
    • 2022
  • Hunminjeongeum was made by the Great King Sejong, and composed of 17 consonant and 11 vowel letters. All the 28 letters were made according to the shape of vocal organ or space at the point of articulation for each letters. This review article focused on phonetic and phoniatric consideration for explanation of the designs of the middle vowel letters, especially three main vowel letters [ • (天, heaven), ㅡ (地, earth), ㅣ (人, human)] using video-fluoroscopic evaluation as well as computed tomography scanning, etc. During articulating / • / sound, a ball-like space at frontal portion of the oral cavity was found, tongue was contracted, and sound was deep (舌縮而聲深). During /ㅡ/ sound, a flat air space between oral tongue and hard palate was created. Tongue was slightly contacted neither deep nor shallow (舌小縮而聲不深不淺). During /ㅣ/ sound, tongue was not contacted and Sound is light (舌不縮而聲淺). Tongue was moved forward making longitudinal oro-pharyngeal air space. So, I'd like to suggest that we had better change the explanation drawing from a philosophical modeling to a more scientific modeling from real vocal tract space modeling during articulating middle vowels of Hunminjeongeum.

VOICE SOURCE ESTIMATION USING SEQUENTIAL SVD AND EXTRACTION OF COMPOSITE SOURCE PARAMETERS USING EM ALGORITHM

  • Hong, Sung-Hoon;Choi, Hong-Sub;Ann, Sou-Guil
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1994년도 FIFTH WESTERN PACIFIC REGIONAL ACOUSTICS CONFERENCE SEOUL KOREA
    • /
    • pp.893-898
    • /
    • 1994
  • In this paper, the influence of voice source estimation and modeling on speech synthesis and coding is examined and then their new estimation and modeling techniques are proposed and verified by computer simulation. It is known that the existing speech synthesizer produced the speech which is dull and inanimated. These problems are arised from the fact that existing estimation and modeling techniques can not give more accurate voice parameters. Therefore, in this paper we propose a new voice source estimation algorithm and modeling techniques which can not give more accurate voice parameters. Therefore, in this paper we propose a new voice source estimation algorithm and modeling techniques which can represent a variety of source characteristics. First, we divide speech samples in one pitch region into four parts having different characteristics. Second, the vocal-tract parameters and voice source waveforms are estimated in each regions differently using sequential SVD. Third, we propose composite source model as a new voice source model which is represented by weighted sum of pre-defined basis functions. And finally, the weights and time-shift parameters of the proposed composite source model are estimeted uning EM(estimate maximize) algorithm. Experimental results indicate that the proposed estimation and modeling methods can estimate more accurate voice source waveforms and represent various source characteristics.

  • PDF