• 제목/요약/키워드: vocal tract

Search Result 172, Processing Time 0.021 seconds

기능성 3차원적 후두 전산화 촬영을 이용한 후두질환의 진단

  • 박영학;김형태;송창은;최혁기;조승호
    • Proceedings of the KSLP Conference
    • /
    • 2003.11a
    • /
    • pp.162-163
    • /
    • 2003
  • 후두의 기능으로는 하기도의 보호, 호흡, 발성, 흉강의 고정 등이 있다. 이 중 발성은 성대의 진동에 의한 성대음이 입술까지의 성도(vocal tract) 및 비강에서의 조음과 공명 과정을 거치면서 이루어진다. 후두 질환을 진단하는 방법으로 간접 후두경, 단순 X-선 검사, 굴곡성 후두경(flexible fiberscope), 후두원시경(telescope), 전산화 단층 촬영, 자기공명영상 등이 사용되어 왔다. (중략)

  • PDF

Estimation of Articulatory Characteristics of Vowels Using 'ArtSim' (Artsim'을 이용한 모음의 조음점 추정에 관한 연구)

  • Kim Dae-Ryun;Cho Cheol-Woo
    • MALSORI
    • /
    • no.35_36
    • /
    • pp.121-129
    • /
    • 1998
  • In this paper, articulatory simulator 'Artsim' is used as a tool for the experiments to examine the articulatory characteristics of 6 different vowels. Each vowels are defined by some articulatory points from their vocal tract area functions and shapes of tongues. Each points are varied systematically to synthesize vowels and the synthesized sound is evaluated by human listners. Finally distributions of each vowels within vowel space is obtained. From the experimental results it is verified that our articulatory simulator can be used effectively to investigate the articulatory characteristics of speech.

  • PDF

편도적출술후 음성변화에 관한 음성학적 및 영상학적 연구

  • 이종환;구수권;이상화;왕수건
    • Proceedings of the KSLP Conference
    • /
    • 1998.11a
    • /
    • pp.184-184
    • /
    • 1998
  • 배경 : 이비인후과 영역에서 많이 시행되어지고 있는 편도적출술은 공명실의 구조중 인강의 구조에 직접적인 영향을 줄 수 있는 수술로서 술후 음성의 변화를 호소하는 경우를 볼 수 있다. 지금까지 음성변화에 관한 음성학적 연구는 많으나 성도(vocal tract)의 변화에 대한 영상학적 연구는 아직 미비하다. 이에 저자들은 편도적출술후의 음성변화에 관한 음성학적 및 영상학적 연구를 시행하였다. (중략)

  • PDF

On a Pitch Alteration Method using Scaling the Harmonics Compensated with the Phase for Speech Synthesis (위상 보상된 고조파 스케일링에 의한 음성합성용 피치변경법)

  • Bae, Myung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.6
    • /
    • pp.91-97
    • /
    • 1994
  • In speech processing, the waveform codings are concerned with simply preserving the waveform of signal through a redundancy reduction process. In the case of speech synthesis, the waveform codings with high quality are mainly used to the synthesis by analysis. Because the parameters of this coding are not classified as both excitation and vocal tract, it is difficult to apply the waveform coding to the synthesis by rule. Thus, in order to apply the waveform coding to synthesis by rule, it is necessary to alter the pitches. In this paper, we proposed a new pitch alteration method that can change the pitch period in waveform coding by dividing the speech signals into the vocal tract and excitation parameters. This method is a time-frequency domain method preserving the phase component of the waveform in time domain and the magnitude component in frequency domain. Thus, it is possible that the waveform coding is carried out the synthesis by rule in speech processing. In case of using the algorithm, we can obtain spectrum distortion with $2.94\%$. That is, the spectrum distortion is decreased more $5.06\%$ than that of the pitch alteration method in time domain.

  • PDF

Voice Personality Transformation Using an Optimum Classification and Transformation (최적 분류 변환을 이용한 음성 개성 변환)

  • 이기승
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.5
    • /
    • pp.400-409
    • /
    • 2004
  • In this paper. a voice personality transformation method is proposed. which makes one person's voice sound like another person's voice. To transform the voice personality. vocal tract transfer function is used as a transformation parameter. Comparing with previous methods. the proposed method makes transformed speech closer to target speaker's voice in both subjective and objective points of view. Conversion between vocal tract transfer functions is implemented by classification of entire vector space followed by linear transformation for each cluster. LPC cepstrum is used as a feature parameter. A joint classification and transformation method is proposed, where optimum clusters and transformation matrices are simultaneously estimated in the sense of a minimum mean square error criterion. To evaluate the performance of the proposed method. transformation rules are generated from 150 sentences uttered by three male and on female speakers. These rules are then applied to another 150 sentences uttered by the same speakers. and objective evaluation and subjective listening tests are performed.

A Study on Correcting Korean Pronunciation Error of Foreign Learners by Using Supporting Vector Machine Algorithm

  • Jang, Kyungnam;You, Kwang-Bock;Park, Hyungwoo
    • International Journal of Advanced Culture Technology
    • /
    • v.8 no.3
    • /
    • pp.316-324
    • /
    • 2020
  • It has experienced how difficult People with foreign language learning, it is to pronounce a new language different from the native language. The goal of various foreigners who want to learn Korean is to speak Korean as well as their native language to communicate smoothly. However, each native language's vocal habits also appear in Korean pronunciation, which prevents accurate information transmission. In this paper, the pronunciation of Chinese learners was compared with that of Korean. For comparison, the fundamental frequency and its variation of the speech signal were examined and the spectrogram was analyzed. The Formant frequencies known as the resonant frequency of the vocal tract were calculated. Based on these characteristics parameters, the classifier of the Supporting Vector Machine was found to classify the pronunciation of Koreans and the pronunciation of Chinese learners. In particular, the linguistic proposition was scientifically proved by examining the Korean pronunciation of /ㄹ/ that the Chinese people were not good at pronouncing.

A Study on the Formant Analysis of Korean Monophthongs and their Resonance Effect in Vocal Tract (한글 단모음의 포만트 분석과 성도내의 공명효과에 관한 연구)

  • Sin, Hyeon-Jae;Yun, Seok-Wang
    • The Journal of the Acoustical Society of Korea
    • /
    • v.6 no.2
    • /
    • pp.30-37
    • /
    • 1987
  • Twelve Korean monophthongs were studied by formant analysis, fundamental frequencies and their harmonics were considered as the parameters of analysis. The analyzed data were twelve Korean monophthongs which were pronounced with the five fundamental frequencies by the five male vocal musicians. The study shows that the first and the second formants are characterized by the resonance of the cavities of pharymx and mouth, respectively. The lip rounding effect detreases the second formant frequency. The phonemes of $[a]/[\alpha ], [e]/[\varepsilon] and [\partial]/[\Lambda]$were not distinguished well in this formant analysis.

  • PDF

Effects of Lax Vox voice therapy in a patient with spasmodic dysphonia: A case report (연축성 발성장애 환자의 Lax Vox 음성치료 효과)

  • Lim, Hye Jin;Choi, Seong Hee;Kim, Jeong Kyu;Choi, Chul-Hee
    • Phonetics and Speech Sciences
    • /
    • v.8 no.2
    • /
    • pp.57-63
    • /
    • 2016
  • Recently, the Lax Vox voice therapy has been used as one of the SOVTE(Semi-Occluded Vocal Tracts Exercise). The purpose of this study was to explore the effect of Lax Vox voice therapy for a patient with Spasmodic dysphonia on voice improvement. One female spasmodic dysphonia patient(age=27) who had been diagnosed by a laryngologist received Lax Vox voice therapy. The Lax Vox protocol was configured as 5 steps (1 warm-up and 4 steps : bubbling without / with phonation/ gliding with phonation/ generalization) in this study. A total of 11 sessions were performed by a certified speech language pathologist. The present study evaluated the acoustic, aerodynamic, auditory perceptual, and patient's self-rating between pre-, mid-, and post- voice therapy. All objective and subjective parameters were improved after voice therapy; Reduced frequency variation, increased maximum phonation time, enlarged voice range, improved 'G' and 'S' in GRBAS & USDRS, and reduced VHI were observed. Especially, decreased $f_0$ and remarkably reduced voice tremor were also demonstrated following Lax Vox voice therapy. Accordingly, Lax Vox voice therapy technique can be useful for improving voice and quality of life in patients with spasmodic dysphonia.

Voice quality of normal elderly people after a 3oz water-swallow test: An acoustic analysis (3온스 물 삼킴검사 이후 정상 노년층의 음질 변화: 음향학적 분석)

  • Lee, Sol Hee;Choi, Hong-Shik;Choi, Seong-Hee;Kim, HyangHee
    • Phonetics and Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.69-76
    • /
    • 2018
  • The elderly are at increased risk of developing dysphagia due to aging and illnesses. The aim of the current study was to analyze, via an acoustic study, the change in the voice quality of normal elderly people after a 3oz water-swallow test. Subjects included a group of 60 normal elderly people (age: $mean{\pm}SD=76.9{\pm}6.66$) and 60 healthy young adults (age: $mean{\pm}SD=25.1{\pm}2.36$). Every participant produced a five-second /a/ phonation pre- and post-swallowing, and the fractioned two-second sections were analyzed using the MDVP (multi dimensional voice program) analysis. The elderly group demonstrated a post-swallowing increase in the following related acoustic parameters: fundamental frequency, fundamental frequency variation, amplitude-variation, and noise in both two-second sections. However, the younger group showed an increase only in frequency related acoustic parameters (i.e., STD ) in the first two-second section. The significant changes in values in the post-swallowing parameters might indicate temporary irregularities in pitch and amplitude along with higher amounts of noise in the voice. The results could be attributed to water residues in the vocal fold and vocal tract, as well as a deterioration of the motor and sensory functions caused by anatomical and physiological changes that result from aging.

Effect of Music Training on Categorical Perception of Speech and Music

  • L., Yashaswini;Maruthy, Sandeep
    • Journal of Audiology & Otology
    • /
    • v.24 no.3
    • /
    • pp.140-148
    • /
    • 2020
  • Background and Objectives: The aim of this study is to evaluate the effect of music training on the characteristics of auditory perception of speech and music. The perception of speech and music stimuli was assessed across their respective stimulus continuum and the resultant plots were compared between musicians and non-musicians. Subjects and Methods: Thirty musicians with formal music training and twenty-seven non-musicians participated in the study (age: 20 to 30 years). They were assessed for identification of consonant-vowel syllables (/da/ to /ga/), vowels (/u/ to /a/), vocal music note (/ri/ to /ga/), and instrumental music note (/ri/ to /ga/) across their respective stimulus continuum. The continua contained 15 tokens with equal step size between any adjacent tokens. The resultant identification scores were plotted against each token and were analyzed for presence of categorical boundary. If the categorical boundary was found, the plots were analyzed by six parameters of categorical perception; for the point of 50% crossover, lower edge of categorical boundary, upper edge of categorical boundary, phoneme boundary width, slope, and intercepts. Results: Overall, the results showed that both speech and music are perceived differently in musicians and non-musicians. In musicians, both speech and music are categorically perceived, while in non-musicians, only speech is perceived categorically. Conclusions: The findings of the present study indicate that music is perceived categorically by musicians, even if the stimulus is devoid of vocal tract features. The findings support that the categorical perception is strongly influenced by training and results are discussed in light of notions of motor theory of speech perception.