• Title/Summary/Keyword: vowel perception

Search Result 90, Processing Time 0.022 seconds

A perceptual study of the three-way contrast in Korean stops with cross-spliced syllables

  • Kim, Mi-Ryoung
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.343-348
    • /
    • 1996
  • This paper examines the contribution of vocalic information (after the onset of voicing) to the perception of Korean alveolar stops: the aspirated /$t^{h}$/, the lenis /t/, and the fortis /$t^{*}$/. These stops have been analyzed as differing in VOT (Abramson & Lisker, 1964), the glottal width or aspiration (Kim, 1970), and F0 and intensity build-up (Han & Weitzman, 1970). These studies focused on the articulatory and acoustic qualities of the consonants and often assumed that the consonantal portion before the onset of voicing plays the main role in maintaining the three-way distinction. In contrast, the role of the following vowels was given less attention. In order to investigate the contribution of the following vowels, a perceptual study was conducted using stimuli cross-spliced from three naturally produced syllables [$t^{h}al$] 'mask', [tal] 'moon', and [$t^{*}al$]) 'daughter'. Stimuli were presented to 12 Korean listeners for identification. Each subject responded to a total of 486 tokens. The results show that vowels play the primary role when the cut occurs at the star of voicing. Even with cuts at 10 ms and 40 ms into voicing, the following vowel still plays a clear role. This suggests that vowels carry the important information for distinguishing the three stops.

  • PDF

An Experimental Phonetic study of Perception of native Korean speakers on English and German $/\int/$ (한국인의 외국어 $/\int/$음에 대한 실험음성학적 연구)

  • Lee Sook-hyang;Kang Hyunsook
    • MALSORI
    • /
    • no.40
    • /
    • pp.1-12
    • /
    • 2000
  • This paper investigated how $/\int/$ in English and German is perceived and interpreted in the loanwords in Korean. $/\int/$ in these languages does not show one-to-one correspondence in Korean: $/\int/$ in the coda position in English and German is perceived as [swi] in Korean while $/\int/$ in the onset position is perceived as [syu]. This paper examined phonetic characteristics of $/\int/$ in English and German through its acoustic analysis and attempted to figure out which factor could explain this surface distribution of [swi] and [syu]; phonological (onset vs. coda) or phonetic (coarticulation) factor. Two acoustic features of $/\int/$ in English and German were examined: duration and energy Peak frequency of the frication noise. German $/\int/$ Perceived as [swi] in Korean showed higher energy Peak frequency and longer duration than that perceived as [syu] in Korean. English iii perceived as [swi] also showed longer duration than that Perceived as [syu] in Korean but energy Peak frequency showed different behavior. English $/\int/$ showed coarticulation with the preceding vowel rather than being affected by its position in the syllable in English. This paper concludes that 1)Phonetic characteristics used are duration and energy Peak frequency of its frication noise when $/\int/$ in English and German are adopted in Korean, 2)duration is used prior to energy peak frequency, which can be used as an enhancing feature.

  • PDF

Phonetic meaning of clarity and turbidity (청탁의 음성학적 의미)

  • Park, Hansang
    • Phonetics and Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.77-89
    • /
    • 2017
  • This study investigates the phonetic meaning of clarity and turbidity(淸濁) that has been used in psychoacoustics, musicology, and linguistics in both the East and the West. With a view to clarifying the phonetic meaning of clarity and turbidity, this study conducts three perception tests. First, 34 subjects were asked to take one of Clear and Turbid by forced choice for 5 pure and complex tones, respectively, ranging from A2 to A6 differing by octave. Second, they were asked to select between the two choices for 25 pure and complex tones, respectively, ranging from A2 to A4 differing by semitone. Third, they were asked to opt for one of the two choices for 8 different vowels of different formant and fundamental frequencies. Results showed that there is a certain range of tone which is perceived as clear, that clarity level increases as fundamental frequency increases, and that pure tones have a higher level of clarity than complex ones, fundamental frequency being equal. Results also showed that vocal tract resonance enhances clarity level on the whole, and that lower vowels have a higher level of clarity than higher ones. This study is significant in that it demonstrates that clarity level is proportional to fundamental frequency and the first formant frequency, all else being equal.

A study on speech analysis of person with presbycusis (노인성 난청인의 음성특성에 관한 연구)

  • Lee, S.M.;Song, C.G.;Woo, H.C.;Lee, Y.M.;Kim, W.K.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1997 no.11
    • /
    • pp.67-70
    • /
    • 1997
  • In this paper, we evaluated the character of speech of hearing impaired person (HIP) who acquire his hearing loss after the youth. It is usually observed that severe HIP decreased not only speech perception but also vocalization. so there is a need for sensitive and quantitative measures or the assesment of the speech of the HIP to serve both diagnostic and prognosic purposes, 7 HIP and 12 normal hearing person(NHP) were studied with pure tone test and speaking test using word/sentence table which consists of vowel(a:), mono and two syllables and a sentence. we analyzed formant frequency, pitch, sound intensity, speech duration of HIP and NHP speech. According to the results, in the HIP's speech we find that formant frequency was shifted, first-formant prominence was reduced, the dynamic range of sound intensity was decreased, speech duration was prolonged. In the next, we expect the correlation between hearing and speech character of HIP is cleared through analysis of more acoustic parameters and precise selection of HIP group.

  • PDF

Perception of Spanish $/{\setminus}/$ - /r/ distinction by native Japanese

  • Mignelina Guirao Jorge A. Gurlekian;Maria A. Garcia Jurado
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.337-342
    • /
    • 1996
  • In prevoius works we have repored phonetic similarities between Japanese and Spanish voweis and syiiabic sounds. (1) (2) (3) (4). In the present communication we explore the relative importance of duration of the consonantal segment to elicit Spanish /l/ - /r/ distinction by native j Japanese talkers. Three Argentine and three trained native Japanese talkers recorded /l-r/ combined with /a/ in VCV sequences. Modifications of consonant duration and vowel context with transitions were m made by editing natural /ala/ sounds. Mixed VCV were produced by combining sounds of both languages. Perceptual tests were produced by combining sounds of both languages perceptual performed presenting the speech material, to native t trained and non trained Japanese listeners. In a tirst sessIOn a d discrimination procedure was applied. The items were arranged in pairs a and listeners Nere told to indicate the pair that sounded different. In the f following session they were asked to identify and type the letter corresponding to each one of the items. Responses arc examined in tenns of critical duration of the interval between vowels. Preliminary results indicate that the duration of intervocalic intervais was a relevant cue for the identification of /l/ and /r/. It seems that to differentiate the two sounds, Japanese listeners required relatively longer interval steps than the argentine suhjects. There was a tendency to conhlse more frequently /l/ for /r/ than viceversa.

  • PDF

Effects of phonological and phonetic information of vowels on perception of prosodic prominence in English

  • Suyeon Im
    • Phonetics and Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.1-7
    • /
    • 2023
  • This study investigates how the phonological and phonetic information of vowels influences prosodic prominence among linguistically untrained listeners using public speech in American English. We first examined the speech material's phonetic realization of vowels (i.e., maximum F0, F0 range, phone rate [as a measure of duration considering the speech rate of the utterance], and mean intensity). Results showed that the high vowels /i/ and /u/ likely had the highest max F0, while the low vowels /æ/ and /ɑ/ tended to have the highest mean intensity. Both high and low vowels had similarly high phone rates. Next, we examined the effects of the vowels' phonological and phonetic information on listeners' perceptions of prosodic prominence. The results showed that vowels significantly affected the likelihood of perceived prominence independent of acoustic cues. The high and low vowels affected probability of perceived prominence less than the mid vowels /ɛ/ and /ʌ/, although the former two were more likely to be phonetically enhanced in the speech than the latter. Overall, these results suggest that perceptions of prosodic prominence in English are not directly influenced by signal-driven factors (i.e., vowels' acoustic information) but are mediated by expectation-driven factors (e.g., vowels' phonological information).

Physiologic Phonetics for Korean Stop Production (한국어 자음생성의 생리음성학적 특성)

  • Hong, Ki-Hwan;Yang, Yoon-Soo
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.17 no.2
    • /
    • pp.89-97
    • /
    • 2006
  • The stop consonants in Korean are classified into three types according to the manner of articulation as unaspirated (UA), slightly aspirated (SA) and heavily aspirated (HA) stops. Both the UA and the HA types are always voiceless in any environment. Generally, the voice onset time (VOT) could be measured spectrographically from release of consonant burst to onset of following vowel. The VOT of the UA type is within 20 msec of the burst, and about 40-50 msec in the SA and 50-70 msec in the HA. There have been many efforts to clarify properties that differentiate these manner categories. Umeda, et $al^{1)}$ studied that the fundamental frequency at voice onset after both the UA and HA consonants was higher than that for the SA consonants, and the voice onset times were longest in the HA followed by the SA and UA. Han, et $al^{2)}$ reported in their speech synthesis and perception studies that the SA and UA stops differed primarily in terms of a gradual versus a relatively rapid intensity build-up of the following vowel after the stop release. Lee, et $al^{3)}$ measured both the intraoral and subglottal air pressure that the subglottal pressure was higher for the HA stop than for the other two stops. They also compared the dynamic pattern of the subglottal pressure slope for the three categories and found that the HA stop showed the most rapid increase in subglottal pressure in the time period immediately before the stop release. $Kagaya^{4)}$ reported fiberscopic and acoustic studies of the Korean stops. He mentioned that the UA type may be characterized by a completely adducted state of the vocal folds, stiffened vocal folds and the abrupt decreasing of the stiffness near the voice onset, while the HA type may be characterized by an extensively abducted state of the vocal folds and a heightened subglottal pressure. On the other hand, none of these positive gestures are observed for the SA type. Hong, et $al^{5)}$ studied electromyographic activity of the thyroarytenoid and posterior cricoarytenoid (PCA) muscles during stop production. He reported a marked and early activation of the PCA muscle associated with a steep reactivation of the thyroarytenoid muscle before voice onset in the production of the HA consonants. For the production of the UA consonants, little or no activation of the PCA muscle and earliest and most marked reactivation of the thyroarytenoid muscle were characteristic. For the SA consonants, he reported a more moderate activation of the PCA muscle than for the UA consonant, and the least and the latest reactivation of the thyroarytenoid muscle. Hong, et $al^{6)}$ studied the observation of the vibratory movements of vocal fold edges in terms of laryngeal gestures according to the different types of stop consonants. The movements of vocal fold edges were evaluated using high speed digital images. EGG signals and acoustic waveforms were also evaluated and related to the vibratory movements of vocal fold edges during stop production.

  • PDF

Effects of Injection Laryngoplasty with Hyaluronic Acid in Patients with Vocal Fold Paralysis

  • Kim, Geun-Hyo;Lee, Jae-Seok;Lee, Chang-Yoon;Lee, Yeon-Woo;Bae, In-Ho;Park, Hee-June;Lee, Byung-Joo;Kwon, Soon-Bok
    • Osong Public Health and Research Perspectives
    • /
    • v.9 no.6
    • /
    • pp.354-361
    • /
    • 2018
  • Objectives: The purpose of this study was to explore the effects of injection laryngoplasty (IL) with hyaluronic acid in patients with vocal fold paralysis (VFP). Methods: A total of 50 patients with VFP participated in this study. Pre- and post-IL assessments were performed, which included analyzing the sustained vowel /a/ phonation, and the patient reading 1 Korean sentence from the "Walk" passage that comprised 25 syllables in 10 words. To investigate the effect of IL on vocal fold function, acoustic analysis (acoustic voice quality index, cepstral peak prominence, maximum phonation time, speaking fundamental frequency) was conducted and auditory-perceptual (grade and overall severity), visual judgment (gap), and self-questionnaire (voice handicap index-10) assessments were performed. Results: The patients with VFP showed statistically significant differences between pre-and post-IL assessments for acoustic and auditory-perception, visual judgment, and self-questionnaire assessments. Conclusion: The patients with VFP showed positive change in vocal fold function between pre- and post-IL measurements. The findings showed that IL with hyaluronic acid is an effective method to improve vocal fold function in patients with VFP.

Perception of lenis and aspirated stops in Seoul Korean by younger and older male and female listeners (한국어 서울 방언의 평음과 격음 변별 지각에서 연령과 성별에 따른 차이)

  • Kim, Jeahong;Kim, Soan;Ahn, Joohee;Nam, Kichun;Choi, Jiyoun
    • Phonetics and Speech Sciences
    • /
    • v.12 no.4
    • /
    • pp.1-8
    • /
    • 2020
  • Traditionally it has been understood that the aspirated and lenis stops in Seoul Korean are distinguished primarily by voice onset time (VOT) and secondarily by other cues such as the fundamental frequency (F0) of the following vowel. However, recent studies on stop production have shown that the aspirated and lenis stops are currently merging in VOT and that they are now differentiated primarily by F0. In the present study, we examined whether the currently reported change in the production domain would be also found in the perception domain. To this end, an auditory identification task was conducted using speech materials of varying VOT and F0 values with young and older male and female Seoul listeners. Results revealed that all listener groups used both VOT and F0 to distinguish the lenis vs. aspirated stops but they used the F0 cue more reliably than the VOT cue in discriminating the stop contrast. The effects of gender and age were found only in the VOT cue (i.e., not in the F0 cue), with the greatest VOT cue weight in older males and the smallest in young females, which is in line with recent production studies.

Investigation of the listening environment of classrooms for elderly people using speech intelligibility tests (음성명료도 시험에 의한 노인 교육시설의 청취환경 조사)

  • Park, Chan-Jae;Kim, Bo-Gyeong;Haan, Chan-Hoon
    • The Journal of the Acoustical Society of Korea
    • /
    • v.40 no.1
    • /
    • pp.18-30
    • /
    • 2021
  • The ultimate goal of the present study is to establish the acoustical performance standards of classroom for the elderly who are incomplete hearing people. As a pilot survey, the present study was conducted to investigate the listening environment and the actual condition of speech perception performance of education facilities for elderly, Acoustic performances of two education facilities for elderly in Cheongju were measured and questionnaire survey was done to elderly people. Also, speech intelligibility tests were undertaken by Consonant Vowel Consonant (CVC) and Phonetically Balanced Words (PBW) methods. The questionnaire survey showed that the elderly were satisfied with the listening environment of the educational facilities in general. Also, it was found that acoustical performances satisfy with the acoustic criteria of general classrooms in Korea. However, the results of the speech intelligibility test showed that the scores of elderly were significantly lower than twenties with normal hearing. It was also revealed that the scores are reduced as the age increases. Thus, it was concluded that the acoustical performance standards of educational facilities for the normal hearing were not suitable for educational facilities for the elderly.