• 제목/요약/키워드: Consonants

Search Result 457, Processing Time 0.03 seconds

The Movements of Vocal Folds during Voice Onset Time of Korean Stops

  • Hong, Ki-Hwan;Kim, Hyun-Ki;Yang, Yoon-Soo;Kim, Bum-Kyu;Lee, Sang-Heon
    • Speech Sciences
    • /
    • v.9 no.1
    • /
    • pp.17-26
    • /
    • 2002
  • Voice onset time (VOT) is defined as the time interval from the oral release of a stop consonant to the onset of glottal pulsing in the following vowel. VOT is a temporal characteristic of stop consonants that reflects the complex timing of glottal articulation relative to supraglottal articulation. There have been many reports on efforts to clarify the acoustical and physiological properties that differentiate the three types of Korean stops, including acoustic, fiberscopic, aerodynamic and electromyographic studies. In the acoustic and fiberscopic studies for stop consonants, the voice onset time and glottal width during the production of stops has been known as the longest and largest in the heavily aspirated type followed by the slightly aspirated type and unaspirated types. The thyroarytenoid and posterior cricoarytenoid muscles were physiologically inter-correlated for differentiating these types of stops. However, a review of the English literature shows that the fine movement of the mucosal edges of the vocal folds during the production of stops has not been well documented. In recent. years, a new method for high-speed recording of laryngeal dynamics by use of a digital recording system allows us to observe with fine time resolution. The movements of the vocal fold edges were documented during the period of stop production using a fiberscopic system of high speed digital images. By observing the glottal width and the visual vibratory movements of the vocal folds before voice onset, the heavily aspirated stop was characterized as being more prominent and dynamic than the slightly aspirated and unaspirated stops.

  • PDF

Nonlinear Interaction between Consonant and Vowel Features in Korean Syllable Perception (한국어 단음절에서 자음과 모음 자질의 비선형적 지각)

  • Bae, Moon-Jung
    • Phonetics and Speech Sciences
    • /
    • v.1 no.4
    • /
    • pp.29-38
    • /
    • 2009
  • This study investigated the interaction between consonants and vowels in Korean syllable perception using a speeded classification task (Garner, 1978). Experiment 1 examined whether listeners analytically perceive the component phonemes in CV monosyllables when classification is based on the component phonemes (a consonant or a vowel) and observed a significant redundancy gain and a Garner interference effect. These results imply that the perception of the component phonemes in a CV syllable is not linear. Experiment 2 examined the further relation between consonants and vowels at a subphonemic level comparing classification times based on glottal features (aspiration and lax), on place of articulation features (labial and coronal), and on vowel features (front and back). Across all feature classifications, there were significant but asymmetric interference effects. Glottal feature.based classification showed the least amount of interference effect, while vowel feature.based classification showed moderate interference, and place of articulation feature-based classification showed the most interference. These results show that glottal features are more independent to vowels, but place features are more dependent to vowels in syllable perception. To examine the three-way interaction among glottal, place of articulation, and vowel features, Experiment 3 featured a modified Garner task. The outcome of this experiment indicated that glottal consonant features are independent to both the place of articulation and vowel features, but the place of articulation features are dependent to glottal and vowel features. These results were interpreted to show that speech perception is not abstract and discrete, but nonlinear, and that the perception of features corresponds to the hierarchical organization of articulatory features which is suggested in nonlinear phonology (Clements, 1991; Browman and Goldstein, 1989).

  • PDF

Neutralization of Vowels /ɨ/ and /u/ after a Labial Consonant in Korean: A Cross-generational Study

  • Kang, Hyunsook
    • Phonetics and Speech Sciences
    • /
    • v.6 no.1
    • /
    • pp.3-10
    • /
    • 2014
  • This study investigated whether Korean vowels, /ɨ/ and /u/, are distinctively perceived after a labial consonant given the fact that native and Sino-Korean nouns showed only vowel /u/ after a labial consonant while this pattern was massively broken by the recent introduction of loanwords. For this purpose, a perception experiment was conducted with $V_1C_1V_2$ sequences in which different vowels /a, i, u/ and consonants /p, t, k/ occurred in $V_1$ and $C_1$ before the target $V_2$, /ɨ/ and /u/. The data was produced by six speakers each from two different age groups, Age20 and Age40/50 in the read speech style. The results showed that consonant /p/ attracted significantly more responses of /u/ from /VCɨ/ sequences and significantly less responses of /u/ from /VCu/ sequence than the other consonants did in both age groups. Furthermore, Age20 group showed significantly less percentage of /u/ responses than Age40 group when the preceding consonant was /p/ regardless of the target vowel. We suggest therefore that unlike the traditional belief of labial assimilation, there is neutralization after a labial consonant in which vowels /ɨ/ and /u/ are often realized as any sound between two vowels, /ɨ/ and /u/. That is, this vowel change is not categorial but it rather produces an ambiguous stimulus which attracts different responses from different listeners. Ambiguous stimulus was produced due to coarticulatory efforts in speech production and perceptual compensation. We also argue that there is generational difference such that Age40/50 group speakers showed stronger tendency to produce /u/ after a labial consonant regardless of whether the target vowel was /ɨ/ or /u/.

Perception and Production of English Geminate Graphemes by Korean Students (한국 학생들의 영어 겹자음 철자 인지와 발화)

  • Cho, Mi-Hui
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2009.05a
    • /
    • pp.1092-1096
    • /
    • 2009
  • While Korean allows the same consonants at the coda of the preceding syllable and at the onset of the following syllable, English does not allow the geminate consonant in the same position. Due to this difference between Korean and English, Korean learners of English tend to incorrectly produce geminate consonants for English geminate graphemes as in summer. Based on this observation, a pilot study was designed to investigate how Korean learners of English perceive and produce English doubleton graphemes and singleton graphemes. Twenty Korean college students were asked to perform a forced-choice perception test as well as a production test for the 36 real word stimuli which consist of near minimal pairs of singleton and doubleton graphemes. The result showed that the accuracy rates for the word with singleton graphemes were relatively high both in perception and production (78.6% and 76.1%, respectively), while those for the word with doubleton graphemes were low both in perception and production (55.3% and 61.7%, respectively). Also, spectrographic analyses were provided where more production errors were witnessed in doubleton grapheme words than singleton grapheme words.

  • PDF

The Acoustic Changes of Voice after Uvulopalatopharyngoplasty (구개인두성형술 후 음성의 음향학적 변화)

  • Hong, K.H.;Kim, S.W.;Yoon, H.W.;Cho, Y.S.;Moon, S.H.;Lee, S.H.
    • Speech Sciences
    • /
    • v.8 no.2
    • /
    • pp.23-37
    • /
    • 2001
  • The primary sound produced by the vibration of vocal folds reaches the velopharyngeal isthmus and is directed both nasally and orally. The proportions of the each component is determined by the anatomical and functional status of the soft palate. The oral sounds composed of oral vowels and consonants according to the status of vocal tract, tongue, palate and lips. The nasal sounds composed of nasal consonants and nasal vowels, and further modified according to the status of the nasal airway, so anatomical abnormalities in the nasal cavity will influence nasal sound. The measurement of nasal sounds of speech has relied on the subjective scoring by listeners. The nasal sounds are described with nasality and nasalization. Generally, nasality has been assessed perceptually in the effect of maxillofacial procedures for cleft palate, sleep apnea, snoring and nasal disorders. The nasalization is considered as an acoustic phenomenon. Snoring and sleep apnea is a typical disorders due to abundant velopharynx. The sleep apnea has been known as a cessation of breathing for at least 10 seconds during sleep. Several medical and surgical methods for treating sleep apnea have been attempted. The uvulopalatopharyngoplasty(UPPP) involves removal of 1.0 to 3.0 cm of soft palate tissue with removal of redundant oropharyngeal mucosa and lateral tissue from the anterior and sometimes posterior faucial pillars. This procedure results in a shortened soft palate and a possible risk following this surgery may be velopharyngeal malfunctioning due to the shortened palate. Few researchers have systematically studied the effects of this surgery as it relates to speech production. Some changes in the voice quality such as resonance (nasality), articulation, and phonation have been reported. In view of the conflicting reports discussed, there remains some uncertainty about the speech status in patients following the snoring and sleep apnea surgery. The study was conducted in two phases: 1) acoustic analysis of oral and nasal sounds, and 2) evaluation of nasality.

  • PDF

Speech Signal Compression and Recovery Using Transition Detection and Approximate-Synthesis (천이구간 추출 및 근사합성에 의한 음성신호 압축과 복원)

  • Lee, Kwang-Seok;Lee, Byeong-Ro
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.13 no.2
    • /
    • pp.413-418
    • /
    • 2009
  • In a speech coding system using excitation source of voiced and unvoiced, it would be involved a distortion of speech qualify in case coexist with a voiced and an unvoiced consonants in a frame. So, We proposed TS(Transition Segment) including unvoiced consonant searching and extraction method in order to uncoexistent with a voiced and unvoiced consonants in a frame. This research present a new method of TS approximate-synthesis by using Least Mean Square and frequency band division. As a result, this method obtain a high qualify approximation-synthesis waveforms within TS by using frequency information of 0.547kHz below and 2.813kHz above. The important thing is that the maximum error signal can be made with low distortion approximation-synthesis waveform within TS. This method has the capability of being applied to a new speech coding of Voiced/Silence/TS, speech analysis and speech synthesis.

Comparisons of Recognition Rates for the Off-line Handwritten Hangul using Learning Codes based on Neural Network (신경망 학습 코드에 따른 오프라인 필기체 한글 인식률 비교)

  • Kim, Mi-Young;Cho, Yong-Beom
    • Journal of IKEEE
    • /
    • v.2 no.1 s.2
    • /
    • pp.150-159
    • /
    • 1998
  • This paper described the recognition of the Off-line handwritten Hangul based on neural network using a feature extraction method. Features of Hangul can be extracted by a $5{\times}5$ window method which is the modified $3{\times}3$ mask method. These features are coded to binary patterns in order to use neural network's inputs efficiently. Hangul character is recognized by the consonant, the vertical vowel, and the horizontal vowel, separately. In order to verify the recognition rate, three different coding methods were used for neural networks. Three methods were the fixed-code method, the learned-code I method, and the learned-code II method. The result was shown that the learned-code II method was the best among three methods. The result of the learned-code II method was shown 100% recognition rate for the vertical vowel, 100% for the horizontal vowel, and 98.33% for the learned consonants and 93.75% for the new consonants.

  • PDF

Relationship between the Maximal Tongue and Lip Strength and Percentage of Correct Consonants and Speech Intelligibility in Dysarthric Adults with Cerebral Palsy (뇌성마비로 인한 마비말장애 성인의 최대 혀 및 입술 강도와 자음정확도 및 말명료도의 관계)

  • Choi, Yoejin;Sim, Hyunsub
    • Phonetics and Speech Sciences
    • /
    • v.5 no.2
    • /
    • pp.11-22
    • /
    • 2013
  • The purpose of this study was to identify the relationship between the strength of the tongue/lip strength and speech production for dysarthric adults with cerebral palsy. The maximal tongue and lip strengths of 22 normal adults, 27 dysarthric adults (10 adults with mild dysarthria, 10 adults with moderate dysarthria, and 7 adults with severe dysarthria) were measured with Iowa Oral Performance Instrument (IOPI). The percentage of correct consonants (PCC) and speech intelligibility were calculated from the words and sentences spoken by the subjects. The results of the study are as follows: First, both the maximal tongue and the maximal lip strength differed significantly between the control group and the group with dysarthria. While the group with mild dysarthria did not show meaningful difference in maximal tongue and lip strengths from the control group, the group with moderate and severe dysarthria showed significantly weaker tongue and lip strength than the control group and the group with mild dysarthria. Second, the current study suggests the existence of a significant correlation between the maximal tongue and lip strength and the PCC and speech intelligibility within all subjects with dysarthria. These findings can serve as an effective foundation to diagnose dysarthria quickly and accurately. The results of this study also indicate that in addition to the maximal tongue strength, the maximal lip strength can prove to be an important index in predicting the speech intelligibility of dysarthric adults with cerebral palsy.

Extraction of Unvoiced Consonant Regions from Fluent Korean Speech in Noisy Environments (잡음환경에서 우리말 연속음성의 무성자음 구간 추출 방법)

  • 박정임;하동경;신옥근
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.4
    • /
    • pp.286-292
    • /
    • 2003
  • Voice activity detection (VAD) is a process that separates the noise region from silence or noise region of input speech signal. Since unvoiced consonant signals have very similar characteristics to those of noise signals, it may result in serious distortion of unvoiced consonants, or in erroneous noise estimation to can out VAD without paying special attention on unvoiced consonants. In this paper, we propose a method to extract in an explicit way the boundaries between unvoiced consonant and noise in fluent speech so that more exact VAD could be performed. The proposed method is based on histogram in frequency domain which was successfully used by Hirsch for noise estimation, and a1so on similarity measure of frequency components between adjacent frames, To evaluate the performance of the proposed method, experiments on unvoiced consonant boundary extraction was performed on seven kinds of noisy speech signals of 10 ㏈ and 15 ㏈ SNR respectively.

The Effect of Stimulus-Response Compatibility on Hangul Transcription Typing Behavior (한글타자 행동에서 자극-반응 합치도 효과)

  • 조양석;황태웅
    • Korean Journal of Cognitive Science
    • /
    • v.5 no.2
    • /
    • pp.25-45
    • /
    • 1994
  • The presnt study investigated the effect of stimulus-response compatibility (S-R compatibility) on Hangul transcription typing. In this experiment, two condition were manipulated, the first was a low S-R compatibility condition in which consonants were typed with left hand and vowels with right hand; the second was a high S-R compatibility condition in which hands for consonants and vowels were reversed. Subjects were requested to type the letter presented on the screen as accurately and immediately as possible. It was found that the compatibility interacted with the vowel shape. That is, in the high S-R compatibility condition, the response time was shorter when letters of vertically- shaped vowel were typed than when those of horizontally-shaped vowel were typed. In the low S-R compatibility codition, however, the response time was shorter for letters of horizontally-shaped vowel than for those of vertically-shaped vowel.

  • PDF