• Title/Summary/Keyword: phonation

Search Result 348, Processing Time 0.025 seconds

Fluency Scoring of English Speaking Tests for Nonnative Speakers Using a Native English Phone Recognizer

  • Jang, Byeong-Yong;Kwon, Oh-Wook
    • Phonetics and Speech Sciences
    • /
    • v.7 no.2
    • /
    • pp.149-156
    • /
    • 2015
  • We propose a new method for automatic fluency scoring of English speaking tests spoken by nonnative speakers in a free-talking style. The proposed method is different from the previous methods in that it does not require the transcribed texts for spoken utterances. At first, an input utterance is segmented into a phone sequence by using a phone recognizer trained by using native speech databases. For each utterance, a feature vector with 6 features is extracted by processing the segmentation results of the phone recognizer. Then, fluency score is computed by applying support vector regression (SVR) to the feature vector. The parameters of SVR are learned by using the rater scores for the utterances. In computer experiments with 3 tests taken by 48 Korean adults, we show that speech rate, phonation time ratio, and smoothed unfilled pause rate are best for fluency scoring. The correlation of between the rater score and the SVR score is shown to be 0.84, which is higher than the correlation of 0.78 among raters. Although the correlation is slightly lower than the correlation of 0.90 when the transcribed texts are given, it implies that the proposed method can be used as a preprocessing tool for fluency evaluation of speaking tests.

Acoustic Characteristics of 'Short Rushes of Speech' using Alternate Motion Rates in Patients with Parkinson's Disease (파킨슨병 환자의 교대운동속도 과제에서 관찰된 '말 뭉침'의 음향학적 특성)

  • Kim, Sun Woo;Yoon, Ji Hye;Lee, Seung Jin
    • Phonetics and Speech Sciences
    • /
    • v.7 no.2
    • /
    • pp.55-62
    • /
    • 2015
  • It is widely accepted that Parkinson's disease(PD) is the most common cause of hypokinetic dysarthria, and its characteristics of 'short rushes of speech' have become more evident along with the severity of motor disorders. Speech alternate motion rates (AMRs) are particularly useful for observing not only rate abnormalities but also deviant speech. However, relatively little is known about the characteristics of 'short rushes of speech' in terms of AMRs of PD except for the perceptual characteristics. The purpose of this study was to examine which acoustic features of 'short rushes of speech' in terms of AMRs are a robust indicator of Parkinsonian speech. Numbers of syllabic repetitions (/pə/, /tə/, /kə/) in AMR tasks were analyzed through acoustic methods observing a spectrogram of the Computerized Speech Lab in 9 patients with PD. Acoustically, we found three characteristics of 'short rushes of speech': 1) Vocalized consonants without closure duration(VC) 76.3%; 2) No consonant segmentation(NC) 18.6%; 3) No vowel formant frequency(NV) 5.1%. Based on these results, 'short rushes of speech' may affect the failure to reach and maintain the phonatory targets. In order to best achieve the therapeutic goals, and to make the treatment most efficacious, it is important to incorporate training methods which are based on both phonation and articulation.

The Study of Breath Competence Depending on Utterance Condition by Healthy Speakers: a Preliminary Study (발화조건에 따른 정상 성인의 호흡 능력 차이 비교: 예비연구)

  • Lee, In-Ae;Lee, Hye-Eun;Hwang, Young-Jin
    • Phonetics and Speech Sciences
    • /
    • v.4 no.2
    • /
    • pp.115-120
    • /
    • 2012
  • This study sought to compare breath competence in three different utterance conditions when reading a passage aloud, making a spontaneous speech, and singing. We tested 15 normal females (ages averaging $24{\pm}4.4$) and measured breath competence through an objective, aero-mechanical instrument called PAS (Phonatory aerodynamic system, model 6600, KAY Electronics, Inc). Breathing sets of inspiration and expiration were measured by breath group number, breath group duration, and the ratio of inspiration to expiration. The results from this study led us to the following conclusion: The breath group number and the breath group duration showed no significant difference. However, the only variance that we could find was in the ratio of inspiration and expiration. In significantly different speech patterns, singing resulted in the most varied ratio of inspiration and expiration, followed by reading a text aloud, and spontaneous speech. The average frequency rates and maximum intensity levels varied with regards to varying utterance conditions. This thus shows that breath competence and phonation competence have a closely interrelated relationship.

A Cepstral Analysis of Breathy Voice with Vocal Fold Paralysis (성대마비로 인한 기식 음성에 대한 Cepstral 분석)

  • Kang, Young-Ae;Seong, Cheol-Jae
    • Phonetics and Speech Sciences
    • /
    • v.4 no.2
    • /
    • pp.89-94
    • /
    • 2012
  • The aim of this study is to investigate the usefulness of the parameter CPP (cepstral peak prominence) and LTAS (long term average spectrum) band energy for an analysis of breathy voice with vocal fold paralysis. Thirty-four female subjects who have vocal paralysis after thyroidectomy participated in this study. According to the perceptual judgements by three speech pathologists and one phonetic scholar, subjects were divided into two groups: breathy voice group (n = 21) and non-breathy voice group (n = 13). Maximum sustained phonation task was measured for acoustic analysis. CPP-related (i.e. mean F0, mean CPP, and mean CPPs) and LTAS-related (i.e. minimum, maximum, and mean) parameters were used. Independent samples t-test was conducted. Regarding CPP, there are significant differences in mean CPP and mean CPPs between groups. The values of mean CPP and CPPs in the non-breathy voice group are higher than those in the breathy voice group. The CPP could be regarded as the useful parameter for breathy voice analysis in the clinic. When it comes to LTAS, energy from 0 to 2 kHz are significantly different between groups. The minimum value of non-breathy group is lower than that of breathy group, whereas the maximum value of non-breathy group is higher. The frequency band below 2 kHz seems to be related to breathy voice.

Acoustic parameters that differentiate /o/ from /u/ in Seoul Korean (서울말 /ㅗ/와 /ㅜ/를 구별하는 음향변수)

  • Byun, Hi-Gyung
    • Phonetics and Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.15-24
    • /
    • 2018
  • Earlier studies reported that the /o/ and /u/ phonemes of Seoul Korean were currently merging in the F1/F2 space. However, studies on perception tests have shown that rates of correctness were high, even in cases where the two vowels overlapped. This study explores whether there is another acoustic parameter that differentiates /o/ from /u/, besides the F1/F2 contrast. Seventy-five native speakers of Seoul Korean, born between 1953 and 1999, participated in a production test. The data collected were analyzed in terms of F1 and F2, H1-H2, and F0. The result shows that the /o/ and /u/ of female speakers almost overlap in the F1/F2 space for all ages, while H1-H2 values are significantly different between the two vowels regardless of age. On the other hand, the /o/ and /u/ of male speakers are largely well separated in the F1/F2 space, while the H1-H2 values between the two vowels are very close at all ages. F0 effect is relatively small for both male and female speakers, even though there is a statistically significant difference. The result of this study provides evidence that female speakers use phonation differences to distinguish /o/ from /u/, and that the F1/F2 contrast has been replaced by H1-H2 values.

The Effectiveness of Furlow's Double Opposing Z-plasty for Treatment of Velopharyngeal Insufficiency (비인강폐쇄기능부전의 치료에 있어서 Furlow 이중 Z-성형술의 효과)

  • Kim, Soo-Ho;Kim, Eu-Gene;Park, Hyong-Wook;Cheon, Kang-Yong;Hwang, Soon-Jung
    • Korean Journal of Cleft Lip And Palate
    • /
    • v.15 no.2
    • /
    • pp.97-108
    • /
    • 2012
  • Velopharyngeal insufficiency (VPI) is improper closure of velopharynx during the phonation and swallowing due to various causes, especially appeared in cleft palate patients. The several surgical techniques and speech therapy can be considered in treatment of VPI. The surgical techniques such as Furlow's double opposing Z-plasty, pharyngeal flap, push-back palatoplasty, etc. have been widely used when the speech therapy is not so much effective. However, there is considerable variability in the methods for evaluation and in success criteria making difficult to compare among surgical techniques. This article reviewed the recent articles about comparing the surgical techniques in treatment of VPI. Although there is no significant difference in speech assessment by speech pathologist, Furlow's double opposing Z-plasty is a useful technique especially diminishing hypernasality and nasal emission.

  • PDF

The Aerodynamic & Respiratory Muscle Pressure Aspects of Patients with Adductor Spasmodic Dysphonia (내전형 경련성발성장애의 호흡압력과 공기역학적 특징)

  • Nam, Do-Hyun;Choi, Seong-Hee;Choi, Jae-Nam;Choi, Hong-Shik
    • Speech Sciences
    • /
    • v.12 no.4
    • /
    • pp.203-213
    • /
    • 2005
  • This study was conducted to investigate the respiratory and aerodynamic function of adductor spasmodic dysphonia (ADSD) patients. Participants were (1) 18 females SD patients with non- Botulinum toxin injection (2) 14 females SD patients who had taken treatment of Botulinum toxin injection. (3) 14 age- and sex- matched normal female controls. Spirometer and phonatory function analyzer were used for respiratory muscle pressure (MIP: Maximum inspiratory pressure), MEP: Maximum expiratory pressure)& MPT(Maximum phonation time) and aerodynamic(F0:Fundamental frequency, intensity, MFR: Mean flow late, Psub: Subglottal pressure) measurement. The results were as follows: (1) Normal group was significantly higher in MIP, MEP, MPT than two SD groups (p < .05); (2) MPT was significantly lower in SD with non-Botulinum toxin injection group than SD with the treatment experience of Botulinum toxin injection (p < .05); (3) All aerodynamic parameters, F0, intensity, MFR, Psub, were not significantly different among three groups(p > .05).The reason of short MPT in ADSD may use lower respiratory pressure than normal group as strategy to decrease their tremulous voice quality. Moreover respiratory muscle pressure was lower than normal group regardless of botulinum toxin injection treatment.

  • PDF

A Comparison of $SaO_2$ & $P_ACO_2$ Changes of Pre & Post Vocal Training in Classical Singers (발성훈련 전후의 혈중 산소포화도($SaO_2$)와 폐포 내 이산화탄소분압($P_ACO_2$)의 비교연구)

  • Nam, Do-Hyun
    • Speech Sciences
    • /
    • v.14 no.3
    • /
    • pp.127-137
    • /
    • 2007
  • The aim of this study is to examine the influence of vocal training on internal respiration in order to develop an efficient method of singing phonation. Five males trained singers (age:$25.0{\pm}1.4years$, career:$6.8{\pm}1.1\;years$) and five female trained singers (age:$22.0{\pm}1.0years$, career:$5.8{\pm}1.2\;years$) participated in this study. $SaO_2$(Oxi Hemoglobin saturation) was measured by Oxy-Pulse meter while $P_ACO_2$ (Pressure Alveolar $CO_2$) was measured by Quick et $CO_2$ before and after 2-minute, 4-minute and 6-minute vocal training. Result showed that $SaO_2$ was within a normal range after vocal training but $P_ACO_2$ came out lower than the normal range (36-40mmHg) after vocal training which led to Hypocapnia. This caused the singers to experience some headache and dizziness.

  • PDF

A Comparison of the Voice Differences of Patients with Idiopathic Parkinson's Disease and a Normal-Aging Group (파킨슨병 환자와 정상 노인의 음성비교)

  • Kang, Young-Ae;Kim, Yong-Duk;Ban, Jae-Chun;Seong, Cheol-Jae
    • Phonetics and Speech Sciences
    • /
    • v.1 no.1
    • /
    • pp.99-107
    • /
    • 2009
  • In view of the hypothesis that the effects of Parkinson disease on voice production can be detected before pharmacological intervention, the voice differences of patients with Idiopathic Parkinson's disease and a healthy aging group were diagnostically analyzed with the long term object of establishing, for clinical purposes, early disease-progression biomarkers. Fifteen patients with Idopathic Parkinson's disease (prior to pharmacological intervention) and a healthy control group of 15 were selected and every voice was recorded three times using praat (ver. 5022) with a headset mic. Relevant parameters - acoustic measure of /a/ phonation, F0 related parameters, MPT related parameters, articulatory ratio, VOT - were then analyzed by MANOVA. Significant differences were found in the F0 related (low F0, high F0, F0 range) and MPT related parameters. There were also significant differences in acoustic measurements (intensity, shimmer, HNR, jitter), AMR (/$t{\Lambda}$/,/$k{\Lambda}$/) and VOT (/ta/), The findings indicated that the voice production of patients with Idiopathic Parkinson's disease have normal pitch but bad quality. In particular, with slow articulatory ratios and VOT values, the tongue tip functioning of patients was lower than for the healthy group.

  • PDF

Comparison of Fundamental Frequency Control Between Thyroarytenoid Muscle and Cricothyroid Muscle: In Vivo Canine Model (생체 발성 모형에서 갑상피열근과 윤상갑상근의 기본주파수 조절 기능의 비교)

  • ;Gerald S. Berke
    • Proceedings of the KOR-BRONCHOESO Conference
    • /
    • 1993.05a
    • /
    • pp.70-70
    • /
    • 1993
  • Fundamental frequency is controlled by contraction of both TA and CT muscle. While activity of the CT is known well, little is known regarding the effect of the TA muscle on vocal fold vibration. To study this, a previously developed in vivo canine laryngeal model was modified. Isolated TA muscle activation was obtained by stimulating sectioned terminal TA branches through small thyroid cartilage windows. The results indicated that TA muscle activation is a major determinant in vocal register shift from falsetto to modal phonation. F0 increased with increasing TA activation in modal register, On the other hand, the F0 decreased with TA activation when the evoked voice belonged to falsetto register. Subglottic pressure increased gradually and OQ decreased gradually with TA activation.

  • PDF