• 제목/요약/키워드: High Vowel

검색결과 144건 처리시간 0.029초

Electroglottography를 사용한 한국어 폐쇄자음의 특성 및 임상적 적용 (Characteristics of Korean Stop Consonants by Using Electroglottography and Its Clinical Application)

  • 채윤정;김현기;홍기환
    • 음성과학
    • /
    • 제4권2호
    • /
    • pp.157-177
    • /
    • 1998
  • An electroglottography (EGG) was used to investigate the function of the vocal folds during their vibration. In this study, four Korean native speakers and 10 vocal polyp patients were selected. To investigate the dynamic change of EGG waveforms for the three-way distinction of Korean stops, a DSP-Sona graph model 5500, a Rino- Laryngeal stroboscope, a CSL model 4300B and a Laryngograph were used. An EGG Model 4338 was used to exam the vocal polyp of patients' voices during high, low, comfortable pitch production. The purpose of this study is to investigate the characteristics of Korean stop consonants in relation to pitch and to observe laryngeal movement during vocal fold vibration and speech production. The basic data accumulated during this research can be applied in clinical treatment. The results are as follows: on the Korean stop consonants, the aspirated stop is the highest in the GOT and PC1. On the angle of vowel contour, the angle of lenis is smaller than the angle of heavily aspirated and glottalized stops. The fundamental frequency is lowest at the lenis stop, In vocal polyp patients', the low pitch range is smaller than in normal speakers'. The pitch break and the vocal fry were observed. The jitter and OQ value are higher in vocal polyp patients than in those of normal speakers'.

  • PDF

구개열 언어의 비음화에 관한 공기역학 및 음향학적 연구 (An Aerodynamic and Acoustic Study of Nasalization in Cleft Palate Speakers.)

  • 이종한;신효근
    • 음성과학
    • /
    • 제5권1호
    • /
    • pp.105-119
    • /
    • 1999
  • Cleft palate patients have general speech problems with resonance disorders and articulation disorders. The aim of this study is to find the aerodynamic and acoustic characteristics of the nasalization in cleft palate speakers. Thirteen control groups and three cleft palate patients pre- and post operation were selected for these studies. The test words are composed by polysyllabic words: consonants between high vowel /i/ analysis. The cleft palate patients repeated test words pre- and post-operation from one, three and six month periods. The subjects repeated test words on Macquirer and on Nasometer Model 6200-3. The aerodynamic and acoustic results of nasalization show as follows: (1) The nasal rate in overall airflow of aspirated consonant for cleft palate patients shows higher levels than that of the control group. It had decreased since one month after operation. (2) The overall airflow of cleft palate patients is higher than in the control group, however oral air pressure is lower than control group. (3) The nasal airflow and the nasal rate in overall airflow of cleft palate patients has higher than the control group, however its decreased after operation. (4) The nasalance scores of cleft palate patients were 40% higher than that of the control group. The scores did not decrease after operation. The nasalance score of lateral and fricative sounds did not decrease after operation.

  • PDF

영어의 강음절(강세 음절)과 한국어 화자의 단어 분절 (Strong (stressed) syllables in English and lexical segmentation by Koreans)

  • 김선미;남기춘
    • 말소리와 음성과학
    • /
    • 제3권1호
    • /
    • pp.3-14
    • /
    • 2011
  • It has been posited that in English, native listeners use the Metrical Segmentation Strategy (MSS) for the segmentation of continuous speech. Strong syllables tend to be perceived as potential word onsets for English native speakers, which is due to the high proportion of strong syllables word-initially in the English vocabulary. This study investigates whether Koreans employ the same strategy when segmenting speech input in English. Word-spotting experiments were conducted using vowel-initial and consonant-initial bisyllabic targets embedded in nonsense trisyllables in Experiment 1 and 2, respectively. The effect of strong syllable was significant in the RT (reaction times) analysis but not in the error analysis. In both experiments, Korean listeners detected words more slowly when the word-initial syllable is strong (stressed) than when it is weak (unstressed). However, the error analysis showed that there was no effect of initial stress in Experiment 1 and in the item (F2) analysis in Experiment 2. Only the subject (F1) analysis in Experiment 2 showed that the participants made more errors when the word starts with a strong syllable. These findings suggest that Koran listeners do not use the Metrical Segmentation Strategy for segmenting English speech. They do not treat strong syllables as word beginnings, but rather have difficulties recognizing words when the word starts with a strong syllable. These results are discussed in terms of intonational properties of Korean prosodic phrases which are found to serve as lexical segmentation cues in the Korean language.

  • PDF

시.청각적 피드백을 이용한 언어중재가 북한이탈주민의 자연스러운 발화에 미치는 효과 (The effects of Speech Intervention for Speech Naturalness of North Korean Refugees Using Visual and Auditory Feedback)

  • 김태희;김수진
    • 말소리와 음성과학
    • /
    • 제2권4호
    • /
    • pp.213-221
    • /
    • 2010
  • The number of North Korean refugees entering South Korea is continuously increasing. North Korean speakers show significant differences in vowel and consonant phonetics, length of vowels, and the rhythm and intonation of sentences. The object of this research was to examine the effectiveness of a speech intervention program for North Korean refugees using visual feedback through acoustical analysis for intonation. The subjects were three adults with no speech disabilities who had been in South Korea for less than five years. They had not received any prior treatment for inflection change. The program was set in a discourse situation and used Praat to evaluate intonation and provide visual feedback as demonstrating proper intonation changes through pitch contour. The results after intervention are as follows. First, intonation was significantly improved according to a 5-point subjective evaluation scale. Second, the pitch contour was similar to the contour of standard South Korean pronunciation. The subjects were very satisfied with this initial treatment and showed a high level of motivation. In subsequent study, the development of intervention and the comparison of interventions will be needed as well.

  • PDF

RBFN을 이용한 음소인식에 관한 연구 (A study on the phoneme recognition using radial basis function network)

  • 김주성;김수훈;허강인
    • 한국통신학회논문지
    • /
    • 제22권5호
    • /
    • pp.1026-1035
    • /
    • 1997
  • 본 연구는 RBFN의 일종인 GPFN과 PNN을 이용한 음소인식에 관한 연구이다. RBFN의 구조는 계층형 신경망의 구조와 유사하지만, hidden층에서 활성화함수, 참조벡터 및 학습알고리듬의 선택이 다르다. 특히 PNN은 시그모이드 함수가 지수를 포함한 함수들의 한 분류로 대체된다는 것이며, 학습이 필요없으므로 전체계산 시간이 빠르게 수행된다. 5모음, 12자음을 대상으로 한 음소인식 실험에서 평가데이터, VQ와 LVQ에 의한 코드북 데이터를 사용한 경우에 음성의 통계적 특성을 잘 반영하고 있는 RBFN의 일종인 GPFN과 PNN의 인식결과가 MLP보다 우수하였다.

  • PDF

음성 및 음향분석 프로그램 Praat의 임상적 활용법 (Guidance to the Praat, a Software for Speech and Acoustic Analysis)

  • 성철재
    • 대한후두음성언어의학회지
    • /
    • 제33권2호
    • /
    • pp.64-76
    • /
    • 2022
  • Praat is a useful analysis tool for linguists, engineers, doctors, speech-language pathologits, music majors, and natural scientists. Basic parameters including duration, pitch, energy and perturbation parameters such as jitter and shimmer can be easily measured and manipulated in the sound editor. When a more in-depth analysis is needed, it is recommended to understand the advanced menus of the object window and learn how to use them. Among the object window menus, vowel formant analysis, spectrum analysis, and cepstrum analysis can be cited as useful ones in the clinical field. The spectrum object can be usefully used for voice quality measurement and diagnosis of patients with voice disorders by showing the energy distribution according to frequency axis (domain). A cepstrum object is useful for speech analysis when periodicity of the sound object is not measurable. The low to high ratio obtained from the spectral object and the CPPs measured from the cepstrum object have attracted many researchers, and it has been proven that the CPPs measured in Praat are relatively excellent.

한국어 자음생성의 생리음성학적 특성 (Physiologic Phonetics for Korean Stop Production)

  • 홍기환;양윤수
    • 대한후두음성언어의학회지
    • /
    • 제17권2호
    • /
    • pp.89-97
    • /
    • 2006
  • The stop consonants in Korean are classified into three types according to the manner of articulation as unaspirated (UA), slightly aspirated (SA) and heavily aspirated (HA) stops. Both the UA and the HA types are always voiceless in any environment. Generally, the voice onset time (VOT) could be measured spectrographically from release of consonant burst to onset of following vowel. The VOT of the UA type is within 20 msec of the burst, and about 40-50 msec in the SA and 50-70 msec in the HA. There have been many efforts to clarify properties that differentiate these manner categories. Umeda, et $al^{1)}$ studied that the fundamental frequency at voice onset after both the UA and HA consonants was higher than that for the SA consonants, and the voice onset times were longest in the HA followed by the SA and UA. Han, et $al^{2)}$ reported in their speech synthesis and perception studies that the SA and UA stops differed primarily in terms of a gradual versus a relatively rapid intensity build-up of the following vowel after the stop release. Lee, et $al^{3)}$ measured both the intraoral and subglottal air pressure that the subglottal pressure was higher for the HA stop than for the other two stops. They also compared the dynamic pattern of the subglottal pressure slope for the three categories and found that the HA stop showed the most rapid increase in subglottal pressure in the time period immediately before the stop release. $Kagaya^{4)}$ reported fiberscopic and acoustic studies of the Korean stops. He mentioned that the UA type may be characterized by a completely adducted state of the vocal folds, stiffened vocal folds and the abrupt decreasing of the stiffness near the voice onset, while the HA type may be characterized by an extensively abducted state of the vocal folds and a heightened subglottal pressure. On the other hand, none of these positive gestures are observed for the SA type. Hong, et $al^{5)}$ studied electromyographic activity of the thyroarytenoid and posterior cricoarytenoid (PCA) muscles during stop production. He reported a marked and early activation of the PCA muscle associated with a steep reactivation of the thyroarytenoid muscle before voice onset in the production of the HA consonants. For the production of the UA consonants, little or no activation of the PCA muscle and earliest and most marked reactivation of the thyroarytenoid muscle were characteristic. For the SA consonants, he reported a more moderate activation of the PCA muscle than for the UA consonant, and the least and the latest reactivation of the thyroarytenoid muscle. Hong, et $al^{6)}$ studied the observation of the vibratory movements of vocal fold edges in terms of laryngeal gestures according to the different types of stop consonants. The movements of vocal fold edges were evaluated using high speed digital images. EGG signals and acoustic waveforms were also evaluated and related to the vibratory movements of vocal fold edges during stop production.

  • PDF

소프라노의 성악 발성에 대한 음향학적 특징 연구 (A Study on Acoustical Properties of Soprano′s Singing)

  • 임동철;문소연;이행세
    • 한국음향학회지
    • /
    • 제19권5호
    • /
    • pp.60-64
    • /
    • 2000
  • 본 논문에서는 소프라노가 성악 발성으로 한국어 단모음을 발음할 때, 그 단모음들의 포르만트가 F0(Fundamental frequency)에 따라 어떻게 바뀌어지는지 연구되었다. 일반적으로 다른 파트의 경우와는 달리, 소프라노가 노래를 할 때에는 포르만트가 그 F0의 영향을 크게 받는 것으로 알려져 있다. 따라서, 성악발성에 대한 연구를 위해서는 소프라노가 발성할 수 있는 전 음역 대의 F0에서 각 모음에 대한 포르만트 분석이 필요하다. 이러한 분석 결과를 바탕으로 성악 발성의 특징들을 패턴화하여 성악발성 평가 시스템이나 성악발성 합성 시스템을 구축할 수 있다. 5명의 전문 소프라노를 대상으로 '아, 에, 이, 오, 우' 5모음의 성악발성을 A3(220.0Hz)에서부터 A5(880.0Hz)까지의 피치에서 포르만트 분석을 하였다. 또한, 일반적인 대화 시 이 5가지 모음의 포르만트를 분석하여 성악발성의 경우와 비교하였다. 연구 결과, '아, 에, 이'의 F2/F1의 그래프가, B4(493.8Hz)이상의 F0에서는 거의 직선으로 나타났다. B4는 Changing Voice가 시작되는 곳으로, 성악가의 음색 변화가 포르만트 형태의 변화와 밀접한 관계가 있음을 알 수 있다. 또한, A5에서는 '아, 에, 이, 오, 우'의 F1, F2의 수치가 거의 일치하는 것으로 나타났다. 즉, 최고음부에서 불려지는 모음들은 서로 구별되기가 어렵게 되는 것이다. 본 논문은 성악발성 평가 시스템이나 성악발성 합성 시스템을 구축할 때에, '아, 오, 우'의 경우에는 B4에서 A5의 F1, F2를 F0대한 기울기로 규정화할 것을 제안한다. 이와 같은 규정화를 통하여 성악발성과 관련된 시스템 구축에 필요한 노력과 비용을 줄일 수 있을 것이다.

  • PDF

기능성 음성장애의 진단을 위한 음향학적, 청지각적 평가 (Acoustic Analysis and Auditory-Perceptual Assessment for Diagnosis of Functional Dysphonia)

  • 김근효;이연우;배인호;이재석;이창윤;박희준;이병주;권순복
    • 임상이비인후과
    • /
    • 제29권2호
    • /
    • pp.212-222
    • /
    • 2018
  • Background and Objectives : The purpose of this study was to compare the measured values of acoustic and auditory perceptual assessments between normal and functional dysphonia (FD) groups. Materials and Methods : 102 subjects with FD and 59 normal voice groups were participated in this study. Mid-vowel portion of the sustained vowel /a/ and two sentences of 'Sanchaek' were edited, concatenated, and analyzed by Praat script. And then auditory-perceptual (AP) rating was completed by three listeners. Results : The FD group showed higher acoustic voice quality index version 2.02 and version 3.01 (AVQIv2 and AVQIv3), slope, Hammarberg index (HAM), grade (G) and overall severity (OS), values than normal group. Additionally, smoothed cepstral peak prominence in Praat (PraatCPPS), tilt, low-to high spectral band energies (L/H ratio), long-term average spectrum (LTAS) in FD group were lower than normal voice group. And the correlation among measured values ranged from -0.250 to 0.960. In ROC curve analysis, cutoff values of AVQIv2, AVQIv3, PraatCPPS, slope, tilt, L/H ratio, HAM, and LTAS were 3.270, 2.013, 13.838, -22.286, -9.754, 369.043, 27.912, and 34.523, respectively, and the AUC of each analysis was over .890 in AVQIv2, AVQIv3, and PraatCPPS, over 0.731 in HAM, tilt, and slope, over 0.605 in LTAS and L/H ratio. Conclusions : In conclusion, AVQI and CPPS showed the highest predictive power for distinguishing between normal and FD groups. Acoustic analyses and AP rating as noninvasive examination can reinforce the screening capability of FD and help to establish efficient diagnosis and treatment process plan for FD.

The Movements of Vocal Folds during Voice Onset Time of Korean Stops

  • Hong, Ki-Hwan;Kim, Hyun-Ki;Yang, Yoon-Soo;Kim, Bum-Kyu;Lee, Sang-Heon
    • 음성과학
    • /
    • 제9권1호
    • /
    • pp.17-26
    • /
    • 2002
  • Voice onset time (VOT) is defined as the time interval from the oral release of a stop consonant to the onset of glottal pulsing in the following vowel. VOT is a temporal characteristic of stop consonants that reflects the complex timing of glottal articulation relative to supraglottal articulation. There have been many reports on efforts to clarify the acoustical and physiological properties that differentiate the three types of Korean stops, including acoustic, fiberscopic, aerodynamic and electromyographic studies. In the acoustic and fiberscopic studies for stop consonants, the voice onset time and glottal width during the production of stops has been known as the longest and largest in the heavily aspirated type followed by the slightly aspirated type and unaspirated types. The thyroarytenoid and posterior cricoarytenoid muscles were physiologically inter-correlated for differentiating these types of stops. However, a review of the English literature shows that the fine movement of the mucosal edges of the vocal folds during the production of stops has not been well documented. In recent. years, a new method for high-speed recording of laryngeal dynamics by use of a digital recording system allows us to observe with fine time resolution. The movements of the vocal fold edges were documented during the period of stop production using a fiberscopic system of high speed digital images. By observing the glottal width and the visual vibratory movements of the vocal folds before voice onset, the heavily aspirated stop was characterized as being more prominent and dynamic than the slightly aspirated and unaspirated stops.

  • PDF