• Title/Summary/Keyword: Prosodic perception

Search Result 36, Processing Time 0.022 seconds

The role of prosody in dialect authentication Simulating Masan dialect with Seoul speech segments

  • Yoon, Kyu-Chul
    • Proceedings of the KSPS conference
    • /
    • 2007.05a
    • /
    • pp.234-239
    • /
    • 2007
  • The purpose of this paper is to examine the viability of simulating one dialect with the speech segments of another dialect through prosody cloning. The hypothesis is that, among Korean regional dialects, it is not the segmental differences but the prosodic differences that play a major role in authentic dialect perception. This work intends to support the hypothesis by simulating Masan dialect with the speech segments from Seoul dialect. The dialect simulation was performed by transplanting the prosodic features of Masan utterances unto the same utterances produced by a Seoul speaker. Thus, the simulated Masan utterances were composed of Seoul speech segments but their prosody came from the original Masan utterances. The prosodic features involved were the fundamental frequency contour, the segmental durations, and the intensity contour. The simulated Masan utterances were evaluated by four native Masan speakers and the role of prosody in dialect authentication and speech synthesis was discussed.

  • PDF

A Study Using Acoustic Measurement and Perceptual Judgment to identify Prosodic Characteristics of English as Spoken by Koreans (음향 측정과 지각 판단에 의한 한국인 영어의 운율 연구)

  • Koo, Hee-San
    • Speech Sciences
    • /
    • v.2
    • /
    • pp.95-108
    • /
    • 1997
  • The purpose of this experimental study was to investigate prosodic characteristics of English as spoken by Koreans. Test materials were four English words, a sentence, and a paragraph. Six female Korean speakers and five native English speakers participated in acoustic and perceptual experiments. Pitch and duration of word syllables were measured from signals and spectrograms made by the Signalize 3.04 software program for Power Mac 7200. In the perceptual experiment, accent position, intonation patterns, rhythm patterns and phrasing were evaluated by the five native English speakers. Preliminary results from this limited study show that prosodic characteristics of Koreans include (1) pitch on the first part of a word and sentence is lower than that of English speakers, but the pitch on the last part is the opposite; (2) word prosody is quite similar to that of an English speaker, but sentence prosody is quite different; (3) the weakest point of sentence prosody spoken by Koreans is in the rhythmic pattern.

  • PDF

Word class information in perception of prosodic prominence by Korean learners of English

  • Im, Suyeon
    • Phonetics and Speech Sciences
    • /
    • v.11 no.4
    • /
    • pp.1-8
    • /
    • 2019
  • This study aims to investigate how prosodic prominence is perceived in relation to word class information (or parts-of-speech) by Korean learners of English compared with native English speakers in public speech. Two groups, Korean learners of English and native English speakers, were asked to judge words perceived as prominent simultaneously while listening to a speech. Parts-of-speech and three acoustic cues (i.e., max F0, mean phone duration, and mean intensity) were analyzed for each word in the speech. The results showed that content words tended to be higher in pitch and longer in duration than function words. Both groups of listeners rated prominence on content words more frequently than on function words. This tendency, however, was significantly greater for Korean learners of English than for native English speakers. Among the parts-of-speech of the content words, Korean learners of English were more likely than native English speakers to judge nouns and verbs as prominent. This study presents evidence that Korean learners of English consider most, if not all, content words as landing locations of prosodic prominence, in alignment with the previous study on the production of prominence.

Perception of Transplanted English Prosody by American and Korean Listeners

  • Yi, So-Pae
    • Speech Sciences
    • /
    • v.14 no.1
    • /
    • pp.73-89
    • /
    • 2007
  • This study explored the perception of transplanted English prosody by thirty American and Korean, male and female listeners. The English utterances of various sentence types produced by Korean and American male speakers were employed to transplant the American prosody contours to Korean English utterances. Then, the thirty subjects were instructed to rate the transplanted prosodic components. Results showed that the interactions between the three factors (e.g., rater groups & transplantation types; transplantation types & sentence types; rater groups & transplantation types & sentence types) turned out to be meaningful. Both Americans and Koreans perceived the effectiveness of the combined effect of transplanted duration and pitch or duration and pitch and intensity. However, when perceiving individual prosodic components, Americans and Koreans showed different perceptual ratings. As for the overall prosody change, Americans perceived the change of intensity in a significant way but Koreans did not because intensity is not a crucial semantic factor in Korean. Americans rated the transplantation of duration alone as ineffective while Koreans rated otherwise. This was explained by the difference between English and Korean. The difference of perspective was also significant with different sentence types, especially with the three sentence types that had speech rates slower than other sentence types. A slower speech rate intensified the mismatch between the transplanted duration and the original pitch causing a negative impression on American listeners whereas this did not affect Korean listeners. Pedagogical implications of the findings are discussed.

  • PDF

Perceptual discrimination of wh-scopes in Gyeongsang Korean (경상 방언 의문문 작용역의 지각 구분)

  • Yun, Weonhee
    • Phonetics and Speech Sciences
    • /
    • v.14 no.2
    • /
    • pp.1-10
    • /
    • 2022
  • A wh-phrase positioned in an embedded clause can be interpreted as having a matrix scope if the sentence is produced with proper prosodic structures such as the wh-intonation. In a previous experiment, a sentence with a wh-phrase in an embedded clause was given to 40 speakers of Gyeongsang Korean. A script containing the sentence was provided to induce a matrix scope interpretation for the wh-phrase. These 40 utterances were prepared as stimuli for a perception test to verify whether the wh-phrases in the stimuli were perceived as having matrix scopes. Each utterance was played thrice to 24 subjects. The results showed that more than half of the 72 responses indicated a preference for an embedded scope rather than a matrix scope in 20 of the utterances. A multiple linear regression analysis showed that the matrix scope responses were best predicted by the magnitude of the pitch prominence in a prosodic word consisting of an embedded verb and a complementizer. The pitch prominence was calculated by subtracting the fundamental frequency (F0) at the right edge of the prosodic word from the peak F0 in the same prosodic word. The smaller the magnitude, the more matrix responses there were. These results suggest that the categorical perception of wh-scopes is based on the magnitude of pitch prominence.

Segmental Interpretation of Suprasegmental Properties in Non-native Phoneme Perception

  • Kim, Miran
    • Phonetics and Speech Sciences
    • /
    • v.7 no.3
    • /
    • pp.117-128
    • /
    • 2015
  • This paper investigates the acoustic-perceptual relation between Korean dent-alveolar fricatives and the English voiceless alveolar fricative /s/ in varied prosodic contexts (e.g., stress, accent, and word initial position). The denti-alveolar fricatives in Korean show a two-way distinction, which can be referred to as either plain (lenis) /s/ or fortis /$s^*$/. The English alveolar voiceless fricative /s/ that corresponds to the two Korean fricatives would be placed in a one-to-two non-native phoneme mapping situation when Korean listeners hear English /s/. This raises an interesting question of how the single fricative of English perceptually maps into the two-way distinction in Korean. This paper reports the acoustic-perceptual mapping pattern by investigating spectral properties of the English stimuli that are heard as either /s/ or /$s^*$/ by Korean listeners, in order to answer the two questions: first, how prosody influences fricatives acoustically, and second, how the resultant properties drive non-native listeners to interpret them as segmental features instead of as prosodic information. The results indicate that Korean listeners' responses change depending on the prosodic context in which the stimuli are placed. It implies that Korean speakers interpret some of the information provided by prosody as segmental one, and that the listeners take advantage of the information in their judgment of non-native phonemes.

Improvement of Naturalness for a HMM-based Korean TTS using the prosodic boundary information (운율경계정보를 이용한 HMM기반 한국어 TTS 자연성 향상 연구)

  • Lim, Gi-Jeong;Lee, Jung-Chul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.9
    • /
    • pp.75-84
    • /
    • 2012
  • HMM-based Text-to-Speech systems generally utilize context dependent tri-phone units from a large corpus speech DB to enhance the synthetic speech. To downsize a large corpus speech DB, acoustically similar tri-phone units are clustered based on the decision tree using context dependent information. Context dependent information includes phoneme sequence as well as prosodic information because the naturalness of synthetic speech highly depends on the prosody such as pause, intonation pattern, and segmental duration. However, if the prosodic information was complicated, many context dependent phonemes would have no examples in the training data, and clustering would provide a smoothed feature which will generate unnatural synthetic speech. In this paper, instead of complicate prosodic information we propose a simple three prosodic boundary types and decision tree questions that use rising tone, falling tone, and monotonic tone to improve naturalness. Experimental results show that our proposed method can improve naturalness of a HMM-based Korean TTS and get high MOS in the perception test.

Identification of English labial consonants by Korean EFL learners (한국 EFL 학습자들의 영어 순자음의 인지)

  • Cho, Mi-Hui
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2006.11a
    • /
    • pp.788-791
    • /
    • 2006
  • The perception of English labial consonants was investigated via experiment where 40 Korean EFL learners identified nonwords with the target labial consonants [p, b, f, v] in 4 different prosodic locations. The results showed that there was a strong positional effect since the accuracy rates of the four target consonants differed by position. Specifically, the average accuracy rate for the target consonants was higher in the stressed intervocalic position and initial onset position than in the unstressed intervocalic position and final coda position. Further, the accuracy rate for [f] is was high in all prosodic locations except the unstressed intervocalic position. This is unexpected in markedness theory given that fricatives are assumed to be more difficult to learn than stops.

  • PDF

Patterns of categorical perception and response times in the matrix scope interpretation of embedded wh-phrases in Gyeongsang Korean (경상 방언 내포문 의문사의 작용역 범주 지각 양상과 반응 속도 연구)

  • Weonhee Yun
    • Phonetics and Speech Sciences
    • /
    • v.15 no.2
    • /
    • pp.1-11
    • /
    • 2023
  • This study investigated the response time and patterns of categorical perception of the wh-scope of an embedded clause with the non-bridge verb, "gung-geum hada 'wonder'," in the matrix verb phrase in Gyeongsang Korean. Using the same procedure as Yun (2022), 72 responses and response times for each stimulus were collected from 24 participants over the course of three trials. The stimuli were recorded readings of 40 speakers (20 male, 20 female). Context was provided to induce a matrix scope interpretation of the embedded wh-phrase in the target sentence. We sorted the 40 stimuli according to the number of matrix scope responses each received, and charted the response times for each stimulus. Although there was considerable overlap for the different types of wh-scope interpretations, there was a clear difference in categorical perception between the matrix and embedded scopes. The 24 participants also differed in their categorical perceptions. The results suggested that response time and wh-scope interpretation were not directly related and that two main weighted factors affected wh-scope interpretation: morpho-syntactic constraints and prosodic structural integrity. The weighting of each of these factors was inversely correlated and varied among subjects.

Acoustic Realization of Metrical Structure in Orally Produced Korean Modern Poetry (한국 현대시 운율의 음향 발현)

  • Kim, Hyun-Gi;Hong, Ki-Hwan;Kim, Sun-Sook
    • Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.181-192
    • /
    • 2004
  • The metrical structures in orally produced the poetry were generally analyzed by accent, metre and syllable. The purpose of this study is to investigate of metrical structures of Korean modem poetry using computer implemented speech analysis system. Two famous poet's poems confidential talk, Miloe and 'A buddhist dance, Sungmu' were selected for prosodic analysis. The informant is 60 years old professor in major of Korean and French poetry. The syllable structures of poems were analyzed primarily by vowel timbers, which can classified compact and diffuse vowels according to the distance of F2-F1. The perception cues of consonants were analyzed by VOT and tensity features of articulation. Rhythm is classified by dactyl, anapest, trochee, spondee and iambic. As a result, syllable structures of Korean modem poetry were mainly CV and CVC and the reading times of each lines were 3-4sec for 12 and 15 syllables. Main metre of Korean modem poems constructed the Imbic and Anapest. The break of each lines were demarcated by grammatical structure or meaning rather than phonetic structures.

  • PDF