• 제목/요약/키워드: Speech Proficiency

검색결과 90건 처리시간 0.021초

L2 Proficiency Effect on the Acoustic Cue-Weighting Pattern by Korean L2 Learners of English: Production and Perception of English Stops

  • Kong, Eun Jong;Yoon, In Hee
    • 말소리와 음성과학
    • /
    • 제5권4호
    • /
    • pp.81-90
    • /
    • 2013
  • This study explored how Korean L2 learners of English utilize multiple acoustic cues (VOT and F0) in perceiving and producing the English alveolar stop with a voicing contrast. Thirty-four 18-year-old high-school students participated in the study. Their English proficiency level was classified as either 'high' (HEP) or 'low' (LEP) according to high-school English level standardization. Thirty different synthesized syllables were presented in audio stimuli by combining a 6-step VOTs and a 5-step F0s. The listeners judged how close the audio stimulus was to /t/ or /d/ in L2 using a visual analogue scale. The L2 /d/ and /t/ productions collected from the 22 learners (12 HEP, 10 LEP) were acoustically analyzed by measuring VOT and F0 at the vowel onset. Results showed that LEP listeners attended to the F0 in the stimuli more sensitively than HEP listeners, suggesting that HEP listeners could inhibit less important acoustic dimensions better than LEP listeners in their L2 perception. The L2 production patterns also exhibited a group-difference between HEP and LEP in that HEP speakers utilized their VOT dimension (primary cue in L2) more effectively than LEP speakers. Taken together, the study showed that the relative cue-weighting strategies in L2 perception and production are closely related to the learner's L2 proficiency level in that more proficient learners had a better control of inhibiting and enhancing the relevant acoustic parameters.

An analysis of listening errors by Korean EFL learners from self-paced passage dictation

  • Cho, Hyesun
    • 말소리와 음성과학
    • /
    • 제13권1호
    • /
    • pp.17-24
    • /
    • 2021
  • In this study, listening errors by Korean EFL learners are comprehensively analyzed from self-paced passage dictation tasks. Fifty-five Korean EFL learners participated in the study. Listeners were asked to write down dictation passages as accurately as possible, while listening to the audio as much as they needed. The results show that (i) low-proficiency learners tend to misperceive longer phrases than high-proficiency learners, (ii) function words are more often omitted or misheard than content words, and (iii) low-proficiency learners have more difficulties with content words than high-proficiency learners do. Most frequent suffix errors were omissions of past or plural suffixes. Among the function words, the most frequent errors were found with auxiliary contractions, infinitive marker to, and articles, mostly in the environment of linking and elision. It is also shown that C-V linking, C-C linking, and elision are the primary sources for the most frequent errors. C-V linking led to errors in correctly locating the word boundary, while C-C linking and elision resulted in omission. These errors show that Korean EFL listeners have difficulties in detecting fine-grained phonetic details to the extent that native speakers can do.

한국인 학습자의 능숙도에 따른 영어 리듬의 시간적 안정성 구현 (Relative Temporal Stability in English Speech Rhythm by Korean learners with low and high English Proficiency.)

  • 김희성;장영수;신지영;김기호
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2007년도 한국음성과학회 공동학술대회 발표논문집
    • /
    • pp.213-216
    • /
    • 2007
  • The purpose of this study is to observe how Korean learners with low (KL) and high (KH) English proficiency manifest English rhythm with respect to the relative temporal stability or temporal constraint of syllable. In this study, speech cycling task, repeating a short phrase with the series of beeps of same interval, was used to examine temporal distribution of stressed beats.

  • PDF

The Contribution of Prosody to the Foreign Accent of Chinese Talkers' English Speech

  • Liu, Xing;Lee, Joo-Kyeong
    • 말소리와 음성과학
    • /
    • 제4권3호
    • /
    • pp.59-73
    • /
    • 2012
  • This study attempts to investigate the contribution of prosody to the foreign accent in Chinese speakers' English production by examining the synthesized speech of crossing native and non-native talkers' prosody and segments. For the stimuli of the foreign accent ratings, we transplanted gender-matched native speakers' prosody onto non-native talkers' segments and vice versa, utilizing the TD-PSOLA algorithm. Eight English native listeners participated in judging foreign accent and comprehensibility of the transplanted stimuli. Results showed that the synthesized stimuli were perceived as stronger foreign accent regardless of speakers' proficiency when English speakers' prosody was crossed with Chinese speakers' segments. This suggests that segments contribute more than prosody to native listeners' evaluation of foreign accent. When transplanted with English speakers' segments, Chinese speakers' prosody showed a difference in duration rather than pitch between high and low proficiency such that stronger foreign accent was detected when low proficient Chinese speakers' duration was crossed with English speakers' segments. This indicated that prosody, more specifically duration, plays a role though the prosodic role is not overall as significant as segments. According to the post acoustic analysis, the temporal features contributing to making the duration parameter prominent as opposed to pitch were found out to be speaking rate, pause duration and pause frequency. Finally, foreign accent and comprehensibility showed no significant correlation such that native listeners had no difficulty listening to highly foreign accented speech.

한국인과 미국인이 발화한 영어전설모음의 상대적 거리 비교 (A Comparative Study of Relative Distances among English Front Vowels Produced by Korean and American Speakers)

  • 양병곤
    • 말소리와 음성과학
    • /
    • 제5권4호
    • /
    • pp.99-107
    • /
    • 2013
  • The purpose of this study is to examine the relative distances among English front vowels in a message produced by 47 Korean and American speakers in order to better instruct pronunciation skills of English vowels for Korean English learners. A Praat script was developed to collect the first and second formant values(F1 and F2) of eight words in each sound file which was recorded from an internet speech archive. Then, the Euclidean distances were measured between the three vowel pairs: [i-ɛ], [i-ɪ], and [ɛ-æ]. The first vowel pair [i-ɛ] was set as the reference from which the relative distances of the other two vowel pairs were measured in percent in order to compare the vowel sounds among speakers of different vocal tract lengths. Results show that F1 values of the front vowels produced by the Korean and American speakers increased from the high front vowel to the low front vowel wih differences among the groups. The Korean speakers generally produced the front vowels with smaller jaw openings than the American speakers did. Secondly, the relative distance of the high front vowel pair [i-ɪ] showed a significant difference between the Korean and American speakers while that of the low front vowel pair [ɛ-æ] showed a non-significant difference. Finally, the Korean speakers in the higher proficiency level produced front vowels with higher F1 values than those in the lower proficiency level. The author concluded that Korean speakers should produce the front high vowels distinctively by securing sufficient relative distance of the formant values. Further studies would be desirable to examine how strong the Korean speakers' English proficiency correlate with the relative distance of target words of comparable productions.

Acoustic Measurement of English read speech by native and nonnative speakers

  • Choi, Han-Sook
    • 말소리와 음성과학
    • /
    • 제3권3호
    • /
    • pp.77-88
    • /
    • 2011
  • Foreign accent in second language production depends heavily on the transfer of features from the first language. This study examines acoustic variations in segments and suprasegments by native and nonnative speakers of English, searching for patterns of the transfer and plausible indexes of foreign accent in English. The acoustic variations are analyzed with recorded read speech by 20 native English speakers and 50 Korean learners of English, in terms of vowel formants, vowel duration, and syllabic variation induced by stress. The results show that the acoustic measurements of vowel formants and vowel and syllable durations display difference between native speakers and nonnative speakers. The difference is robust in the production of lax vowels, diphthongs, and stressed syllables, namely the English-specific features. L1 transfer on L2 specification is found both at the segmental levels and at the suprasegmental levels. The transfer levels measured as groups and individuals further show a continuum of divergence from the native-like target. Overall, the eldest group, students who are in the graduate schools, shows more native-like patterns, suggesting weaker foreign accent in English, whereas the high school students tend to involve larger deviation from the native speakers' patterns. Individual results show interdependence between segmental transfer and prosodic transfer, and correlation with self-reported proficiency levels. Additionally, experience factors in English such as length of English study and length of residence in English speaking countries are further discussed as factors to explain the acoustic variation.

  • PDF

English vowel production conditioned by probabilistic accessibility of words: A comparison between L1 and L2 speakers

  • Jonny Jungyun Kim;Mijung Lee
    • 말소리와 음성과학
    • /
    • 제15권1호
    • /
    • pp.1-7
    • /
    • 2023
  • This study investigated the influences of probabilistic accessibility of the word being produced - as determined by its usage frequency and neighborhood density - on native and high-proficiency L2 speakers' realization of six English monophthong vowels. The native group hyperarticulated the vowels over an expanded acoustic space when the vowel occurred in words with low frequency and high density, supporting the claim that vowel forms are modified in accordance with the probabilistic accessibility of words. However, temporal expansion occurred in words with greater accessibility (i.e., with high frequency and low density) as an effect of low phonotactic probability in low-density words, particularly in attended speech. This suggests that temporal modification in the opposite direction may be part of the phonetic characteristics that are enhanced in communicatively driven focus realization. Conversely, none of these spectral and temporal patterns were found in the L2 group, thereby indicating that even the high-proficiency L2 speakers may not have developed experience-based sensitivity to the modulation of sub-categorical phonetic details indexed with word-level probabilistic information. The results are discussed with respect to how phonological representations are shaped in a word-specific manner for the sake of communicatively driven lexical intelligibility, and what factors may contribute to the lack of native-like sensitivity in L2 speech.

Digital enhancement of pronunciation assessment: Automated speech recognition and human raters

  • Miran Kim
    • 말소리와 음성과학
    • /
    • 제15권2호
    • /
    • pp.13-20
    • /
    • 2023
  • This study explores the potential of automated speech recognition (ASR) in assessing English learners' pronunciation. We employed ASR technology, acknowledged for its impartiality and consistent results, to analyze speech audio files, including synthesized speech, both native-like English and Korean-accented English, and speech recordings from a native English speaker. Through this analysis, we establish baseline values for the word error rate (WER). These were then compared with those obtained for human raters in perception experiments that assessed the speech productions of 30 first-year college students before and after taking a pronunciation course. Our sub-group analyses revealed positive training effects for Whisper, an ASR tool, and human raters, and identified distinct human rater strategies in different assessment aspects, such as proficiency, intelligibility, accuracy, and comprehensibility, that were not observed in ASR. Despite such challenges as recognizing accented speech traits, our findings suggest that digital tools such as ASR can streamline the pronunciation assessment process. With ongoing advancements in ASR technology, its potential as not only an assessment aid but also a self-directed learning tool for pronunciation feedback merits further exploration.

소음과 속도를 변화시킨 영어 문장 따라하기에 대한 연구 (Korean Students' Repetition of English Sentences Under Noise and Speed Conditions)

  • 김은지;양병곤
    • 음성과학
    • /
    • 제11권2호
    • /
    • pp.105-117
    • /
    • 2004
  • Recently, many scholars have emphasized the importance of English listening ability for smoother communication. Most audio materials, however, were recorded in a quiet sound-proof booth. Therefore, students who have spent so much time listening to the ideal audio materials are expected to have difficulty communicating with native speakers in the real life. In this study, we examined how well thirty three Korean university students and five native speakers will repeat the recorded English sentences under noise and speed conditions. The subjects' production was scored by listening to each recorded sentence and counting the number of words correctly produced and determined the percent ratios of correctly produced words to the total words in each sentence. Results showed that the student group correctly repeated around 65% of all the words in each sentence while the native speakers demonstrated almost perfect match. It seemed that the students had difficulty perceiving and repeating function words in various conditions. Also, high-proficiency student group outperformed the low-proficiency student group particularly in their repetition of function words. In addition, the student subjects' accuracy of repetition remarkably dropped when the normal sentences were both sped up and mixed with noise. Finally, it was observed that the Korean students' percent correct ratio fell down as the stimulus sentence became longer.

  • PDF

The relationship between vowel production and proficiency levels in L2 English produced by Korean EFL learners

  • Lee, Seohee;Rhee, Seok-Chae
    • 말소리와 음성과학
    • /
    • 제11권2호
    • /
    • pp.1-13
    • /
    • 2019
  • This study explored the relationship between accurate vowel production and proficiency levels in L2 English produced by Korean EFL adult learners. To this end, nine English vowels /i, ɪ, ɛ, æ, ʌ, ɔ, ɑ, ʊ, u/ were selected and adjacent vowels paired up (e.g., /i/-/ɪ/, /u/-/ʊ/, /ɛ/-/æ/, /ʌ/-/ɔ/, /ɔ/-/ɑ/). The spectral features of the pairs were measured instrumentally, namely F1 (indicating tongue height) and F2 (indicating tongue backness). Meanwhile, the durations as well as spectral features of the tense and lax counterparts in /i/-/ɪ/ and /u/-/ʊ/ were measured, as both temporal and spectral features are important in distinguishing them. The findings of this study confirm that higher-rated speakers were better able to distinguish the contrasts in the front vowel pairs /i/-/ɪ/ and /ɛ/-/æ/ than lower-rated learners, but in the central and back vowel pairs /u/-/ʊ/and /ʌ/-/ɔ/ (though not /ɔ/-/ɑ/), Korean EFL learners generally showed difficulty distinguishing adjacent vowels with spectral cues. On the other hand, the durations of the tense and lax vowels showed that the lower-rated speakers were less able to use the temporal feature to differentiate tense vowels from their lax counterparts, unlike previous studies that found that in general Korean learners depend excessively on the temporal cue to distinguish tense and lax vowels.