• Title/Summary/Keyword: A prosodic phrase

Search Result 83, Processing Time 0.022 seconds

The Phonetic Realization of intermediate phrase in French Intonation (프랑스어 억양구조에서 중간구의 음성적 실현 양상)

  • Yuh, Hea-Oak;Lee, Eun-Yung
    • Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.185-200
    • /
    • 2002
  • The current study confirmed the existence of an ip prosodic level in French intonation structure, as previously proposed by Sun-Ah Jun & $C\acute{e}cile$cile Fougeron (2000). However, in contrast to the previous suggestion of the plateau realized in an ip in several syntactic structures, the current study supposed that the plateau doesn't come from the different type of syntactic structures but arise from the unspecified syllables without any PA in an ip. Because if we limited ip phrasal tone to the syntactic structure, it would be difficult to find the more general reasons of ip level. Besides /Hi/ and /$H^*$/ we also used /$Hi^*$/ for the focused syllable in the current study. In emphasized sentences, in general, /$Hi^*$/ appeared in the first or second syllable of a leftward AP in an ip and /$H^*$/ in the final syllable of a rightmost AP of an ip, In contrast to these PAs, /$Hi^*$/ might appear in any syllable in an ip, but not to far from /$H^*$/ because the duration time and length t of plateau realized between /$Hi^*$/ and /$H^*$/ or /Hi/ and /$H^*$/ would make an essential harmonious rhythmic unit, Therefore, the current study determined the duration time and the number of syllables realized in each plateau in an ip level composed of more than one AP. As a phrase constituent structure, there is a practical need for intermediate prosodic units to allow for generalization over the many possible combinations of prosodic patterns that can occur. Further evidence is still needed to analyze and relate the different pitch ranges of the plateau of an ip according to the syntactic structure, to identify the considerable character in the French prosodic hierarchy.

  • PDF

Prosodic Annotation in a Thai Text-to-speech System

  • Potisuk, Siripong
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 2007.11a
    • /
    • pp.405-414
    • /
    • 2007
  • This paper describes a preliminary work on prosody modeling aspect of a text-to-speech system for Thai. Specifically, the model is designed to predict symbolic markers from text (i.e., prosodic phrase boundaries, accent, and intonation boundaries), and then using these markers to generate pitch, intensity, and durational patterns for the synthesis module of the system. In this paper, a novel method for annotating the prosodic structure of Thai sentences based on dependency representation of syntax is presented. The goal of the annotation process is to predict from text the rhythm of the input sentence when spoken according to its intended meaning. The encoding of the prosodic structure is established by minimizing speech disrhythmy while maintaining the congruency with syntax. That is, each word in the sentence is assigned a prosodic feature called strength dynamic which is based on the dependency representation of syntax. The strength dynamics assigned are then used to obtain rhythmic groupings in terms of a phonological unit called foot. Finally, the foot structure is used to predict the durational pattern of the input sentence. The aforementioned process has been tested on a set of ambiguous sentences, which represents various structural ambiguities involving five types of compounds in Thai.

  • PDF

Boundary Tones of Intonational Phrase-Final Morphemes in Dialogues (대화체 억양구말 형태소의 경계성조 연구)

  • Han, Sun-Hee
    • Speech Sciences
    • /
    • v.7 no.4
    • /
    • pp.219-234
    • /
    • 2000
  • The study of boundary tones in connected speech or dialogues is one of the most underdeveloped areas of Korean prosody. This. paper concerns the boundary tones of intonational phrase-final morphemes which are shown in the speech corpus of dialogues. Results of phonetic analysis show that different kinds of boundary tones are realized, depending on the positions of the intonational phrase-final morphemes in the sentences.. This study has also shown that boundary tone patterning is somewhat related to the sentence structure, and for better speech recognition and speech synthesis, it presents a simple model of boundary tones based on the fundamental frequency contour. The results of this study will contribute to our understanding of the prosodic pattern of Korean connected speech or dialogues.

  • PDF

A Study on the Characteristics of the Intonational Slope of the Korean Broadcasting News Utterances (한국어 방송 뉴스 발화의 억양 기울기 특성 연구)

  • In, Ji-Young;Seong, Cheol-Jae
    • MALSORI
    • /
    • no.66
    • /
    • pp.21-39
    • /
    • 2008
  • The purpose of this study is to analyze the intonational slope characteristics of the Korean news utterances. Prosodic phrases were analyzed in terms of the K-ToBI labeling system. In addition, the change of intonation contour that occurs throughout the sentences was discussed in terms of types of media and gender. Results showed that the overall declination of the intonation contour of radio and male revealed a gentler slope than that of TV and female, respectively. While the regression of the top line slope showed male's higher $R^2$ with the number of words, the base line slope of the radio and female was proved to be highly influenced from the number of syllables, words, and prosodic phrases. A lot more independent variables statistically affected to the base line slope. This means that the base line slope was strongly related to the variables, the top line slope, otherwise, could be more freely fluctuated due to the light correlation with them.

  • PDF

Perceptual discrimination of wh-scopes in Gyeongsang Korean (경상 방언 의문문 작용역의 지각 구분)

  • Yun, Weonhee
    • Phonetics and Speech Sciences
    • /
    • v.14 no.2
    • /
    • pp.1-10
    • /
    • 2022
  • A wh-phrase positioned in an embedded clause can be interpreted as having a matrix scope if the sentence is produced with proper prosodic structures such as the wh-intonation. In a previous experiment, a sentence with a wh-phrase in an embedded clause was given to 40 speakers of Gyeongsang Korean. A script containing the sentence was provided to induce a matrix scope interpretation for the wh-phrase. These 40 utterances were prepared as stimuli for a perception test to verify whether the wh-phrases in the stimuli were perceived as having matrix scopes. Each utterance was played thrice to 24 subjects. The results showed that more than half of the 72 responses indicated a preference for an embedded scope rather than a matrix scope in 20 of the utterances. A multiple linear regression analysis showed that the matrix scope responses were best predicted by the magnitude of the pitch prominence in a prosodic word consisting of an embedded verb and a complementizer. The pitch prominence was calculated by subtracting the fundamental frequency (F0) at the right edge of the prosodic word from the peak F0 in the same prosodic word. The smaller the magnitude, the more matrix responses there were. These results suggest that the categorical perception of wh-scopes is based on the magnitude of pitch prominence.

Intonational Pattern Frequency of Seoul Korean and Its Implication to Word Segmentation

  • Kim, Sa-Hyang
    • Speech Sciences
    • /
    • v.15 no.2
    • /
    • pp.21-30
    • /
    • 2008
  • The current study investigated distributional properties of the Korean Accentual Phrase and their implication to word segmentation. The properties examined were the frequency of various AP tonal patterns, the types of tonal patterns that are imposed upon content words, and the average number and temporal location of content words within the AP. A total of 414 sentences from the Read speech corpus and the Radio corpus were used for the data analysis. The results showed that the 84% of the APs contained one content word, and that almost 90% of the content words are located in AP-initial position. When the AP-initial onset was not an aspirated or tense consonant, the most common AP patterns were LH, LHH, and LHLH (78%), and 88% of the multisyllabic content words start with a rising tone in AP-initial position. When the AP-initial onset was an aspirated or tense consonant, the most common AP patterns were HH, HHLH, and HHL (72%), and 74% of the multisyllabic content words start with a level H tone in AP-initial position. The data further showed that 84.1% of APs end with the final H tone. The findings provide valuable information about the prosodic pattern and structure of Korean APs, and account for the results of a previous study which showed that Korean listeners are sensitive to AP-initial rising and AP-final high tones (Kim, 2007). This is in line with other cross-linguistic research which has revealed the correlation between prosodic probability and speech processing strategy.

  • PDF

English listening error analyses based on intonation phrases (억양단위에 기초한 영어 청해 오류분석)

  • Lee Kyungmi
    • Proceedings of the KSPS conference
    • /
    • 2003.05a
    • /
    • pp.163-167
    • /
    • 2003
  • Intonation as suprasegmental phonetic features conveys meanings on the postlexical or utterance level in a linguistically structured way. It includes three aspects: tunes, relative prominence, and intonational phrasing. In this article, I will treat how prosodic phrasing is functionally related to the listening comprehension of English by analysing the students' errors of listening comprehension. When utterance meaning is conveyed, it is realized to be divided into intonational phrases. The small intonational phrase is regarded as an intermediate phrase which has a primary accent and a phrase tone or audible break. Most students' errors of listening occurred with linking pronunciation in the intermediate phrases of the fast speech. Thus through the smallest unit with tune we can help students improve their pronunciation and listening ability of English.

  • PDF

A study on the change of prosodic units by speech rate and frequency of turn-taking (발화 속도와 말차례 교체 빈도에 따른 운율 단위 변화에 관한 연구)

  • Won, Yugwon
    • Phonetics and Speech Sciences
    • /
    • v.14 no.2
    • /
    • pp.29-38
    • /
    • 2022
  • This study aimed to analyze the speech appearing in the National Institute of Korean Language's Daily Conversation Speech Corpus (2020) and reveal how the speech rate and the frequency of turn-taking affect the change in prosody units. The analysis results showed a positive correlation between intonation phrase, word phrase frequency, and speaking duration as the speech speed increased; however, the correlation was low, and the suitability of the regression model of the speech rate was 3%-11%, which was weak in explanatory power. There was a significant difference in the mean speech rate according to the frequency of the turn-taking, and the speech rate decreased as the frequency of the turn-taking increased. In addition, as the frequency of turn-taking increased, the frequency of intonation phrases, the frequency of word phrases, and the speaking duration decreased; there was a high negative correlation. The suitability of the regression model of the turn-taking frequency was calculated as 27%-32%. The frequency of turn-taking functions as a factor in changing the speech rate and prosodic units. It is presumed that this can be influenced by the disfluency of the dialogue, the characteristics of turn-taking, and the active interaction between the speakers.

The Role of Post-lexical Intonational Patterns in Korean Word Segmentation

  • Kim, Sa-Hyang
    • Speech Sciences
    • /
    • v.14 no.1
    • /
    • pp.37-62
    • /
    • 2007
  • The current study examines the role of post-lexical tonal patterns of a prosodic phrase in word segmentation. In a word spotting experiment, native Korean listeners were asked to spot a disyllabic or trisyllabic word from twelve syllable speech stream that was composed of three Accentual Phrases (AP). Words occurred with various post-lexical intonation patterns. The results showed that listeners spotted more words in phrase-initial than in phrase-medial position, suggesting that the AP-final H tone from the preceding AP helped listeners to segment the phrase-initial word in the target AP. Results also showed that listeners' error rates were significantly lower when words occurred with initial rising tonal pattern, which is the most frequent intonational pattern imposed upon multisyllabic words in Korean, than with non-rising patterns. This result was observed both in AP-initial and in AP-medial positions, regardless of the frequency and legality of overall AP tonal patterns. Tonal cues other than initial rising tone did not positively influence the error rate. These results not only indicate that rising tone in AP-initial and AP_final position is a reliable cue for word boundary detection for Korean listeners, but further suggest that phrasal intonation contours serve as a possible word boundary cue in languages without lexical prominence.

  • PDF

Realizations of Discourse Focus and Structure of Intonation in Japanese (일본어의 초점 실현과 인토네이션의 구조)

  • Choi, Young-Sook
    • Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.187-200
    • /
    • 2002
  • The purpose of the present study is to see in terms of $F_{0}$ variation in Japanese how discourse focus and the lexical word accent interact with each other in realizing overall intonation patterns. Discourse focus causes prosodic restructuring of phrase structures and, as a result, largely affects pitch contours, whereas the lexical word accent is said to delimit the $F_{0}$ into a certain range. Measurement of $F_{0}$ was made of utterances of Japanese sentences to observe behavior of pitch contours with varied focus assignment and lexical accent specifications. The utterances were obtained in question-answer discourse contexts so that in a sentence, either one NP was always focused or no focus was assigned. I set four points for $F_{0}$ measurement; $F_{1s},F_{1m}, F_{2s}$, and $F_{2m}$, two for each noun phrase corresponding to $F_{0}$ at the beginning of the first syllable and that of the vocalic portion of the second syllable in the two NP's. The results of present study were as follows: (1) for all combination of lexical accent types, the $F_{0}$ rise both in NP1 and NP2 are higher when focused than when not focused. (2) NP2 starts a new accentual phrase when focused, showing even higher $F_{0}$ than NP1, the latter of which implies that in forming a new accentual phrase by focusing, catathesis does not seem to take effect on NP2 preceded by accented NP1. (3) unfocused NP2 preceded by unaccented NP1 has higher $F_{0}$ than those preceded by accented NP1.

  • PDF