• Title/Summary/Keyword: prosodic phrasing

Search Result 27, Processing Time 0.017 seconds

The Modelling of Prosodic Phrasing and Segmental Duration using CART (CART를 이용한 운율구 추출 및 음소 지속 시간 모델링)

  • 이상호
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.06c
    • /
    • pp.135-138
    • /
    • 1998
  • 본 논문에서는 트리 기반 모델링 기법 중 하나인 CART(Classification And Regression Trees) 방법을 이용하여, 운율구 추출, 운율구 사이의 휴지 기간, 음소 지속 시간을 모델링 하고자 한다. 총 400문장(약 33분)의 코퍼스를 수집한 후, 그 중 240문장(약 20분)을 이용하여 결정 트리와 회귀 트리를 학습시키고 160문장(약 13분)에 대해 실험하였다. 운율구 경계를 결정하는 결정 트리의 오류율은 14.6%이었고, 운율구 사이의 휴지 기간과 음소 지속 시간을 예측하는 회귀 트리들의 평균 제곱 오류근(RMSE)이 각각 132.61msec, 21.97msec이었다.

The Modelling of Prosodic Phrasing and Pause Duration using CART (CART를 이용한 운율구 추출 및 휴지기간 모델링)

  • 이상호
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.08a
    • /
    • pp.81-86
    • /
    • 1998
  • 트리 기반 모델링 기법 중 하나인 CART 방법을 이용하여, 운율구 추출과 운율구 사이의 휴지 기간을 모델링 하고자 한다. 모델링을 위한 특징 변수들의 유효성을 실험에 앞서 알아본 후, 생성된 트리들을 해석함으로써 제안하는 특징 변수들이 효과적임을 보인다. 음성 정보를 제외한 문서 정보만을 이용하여 실험한 결과, 운율구 경계 결정 오류율은 14.46% 이었고, 휴지 기간 예측 RMSE 가 132.61 msec 이었다.

  • PDF

An acoustic study of Korean lenis stop voicing - in relation to prosodic structure - (국어 파열연자음 유성음화에 관한 음향음성학적 고찰 -운율구조와 관련하여-)

  • Kim Hyo Sook;Kim Sun Ju;Kim Sunmi
    • MALSORI
    • /
    • no.39
    • /
    • pp.15-24
    • /
    • 2000
  • This study aims to reexamine Korean Lenis Stop Voicing (henceforth, LSV) and to specify its phonetic conditions in phonetic terms. LSV optionally occurs within certain prosodic domains. They are called 'Malthomak'(Lee, 1996),'phonological phrase'(Kang, 1992), or 'accentual phrase'(Jun, 1993). On the basis of Jun's phrasing, this study focuses on the more specific phonetic conditions of LSV in the accentual phrase medial position, sub-classifying voicing as complete and partial. The results shows that whether the stops become completely voiced or partially voiced was determined by the various phonetic environments, such as adjacent segments and following intonational phrase boundaries. It is shown that the conditions of LSV should be described in terms of more detailed phonetic environments and that they could be used in predicting the class of voicing.

  • PDF

An Acoustic Analysis of the Aspiration Merger in Korean

  • Mi, Jang
    • Phonetics and Speech Sciences
    • /
    • v.3 no.1
    • /
    • pp.67-75
    • /
    • 2011
  • In Korean, 'Aspiration Merger' is the result of the heteromorphemic sequence of lenis stop and /h/ becoming a single aspirated stop word-medially. However, the contrast between lenis stop-plus-/h/ and an underlying aspirated stop is maintained when they span Phonological Phrase boundaries. By varying the position in the prosodic domain such as APP (Across Phonological Phrase) and PPM (Phonological Phrase Medial) positions, the phonetic properties of the two categories are compared. In the results from noise duration and change of intensity, lenis stop-plus-/h/ show a large difference between the APP and PPM positions. The results from a noise duration comparison show that the two categories are completely neutralized into aspirated stop in the PPM position and the complete neutralization is sensitive to prosodic phrasing.

  • PDF

The acquisition of boundary tones in spontaneous speech by Korean learners of English

  • Choe, Wook Kyung
    • Phonetics and Speech Sciences
    • /
    • v.12 no.4
    • /
    • pp.47-55
    • /
    • 2020
  • The current study was designed to investigate which type of phrase boundary tones high-intermediate Korean learners of English used in their spontaneous speech. These boundary tones were compared to those used in native speakers' spontaneous speech to examine whether the learners successfully acquired the use of boundary tones. To achieve this purpose, 10 Korean learners of English and four native speakers of English participated in the current study. The participants were asked to summarize the stories of short videos, and the tonal and the phrasing patterns of the obtained spontaneous speech were analyzed using Tone and Break Indices (ToBI) transcription conventions. The results indicated that both the native speakers and the Korean learners frequently marked their intonational phrase boundaries with high boundary tones. However, regarding the prosodic phrase positions within a sentence, Korean learners frequently used steep rising tones (i.e., H-H%) while native speakers used gradual rising tones (i.e., L-H%) for sentence-final intonational phrases. Overall, the findings suggested that high-intermediate Korean learners understood the forward-looking function of the high boundary tones and that they were able to make use of these tones to mark intonational phrases in their spontaneous speech.

Tree-based Modeling of Prosodic Phrasing and Segmental Duration (운율구 추출 및 음소 지속 시간의 트리 기반 모델링)

  • 이상호;오영환
    • The Journal of the Acoustical Society of Korea
    • /
    • v.17 no.6
    • /
    • pp.43-53
    • /
    • 1998
  • 본 논문에서는 한국어 TTS시스템을 위한 운율구 추출, 운율구 사이의 휴지 기간, 음소의 지속 시간 모델링 방법을 설명한다. 실험을 위해 여러 장르로 구성된 400문장을 선 정하고, 이를 전문 여성 아나운서가 발성하였다. 녹음된 음성 신호에 대해 음소 및 운율구 경계를 결정하고, 문장에 대해서는 형태소 분석, 발음표기 변환, 구문 분석을 수행하였다. 400문장(약33분) 중 240문장(약20분)을 이용하여 결정 트리 및 회귀 트리를 학습시킨 후, 160분장(약13분)에 대해 실험하였다. 운율 모델링을 위한 특징들이 제안되었고, 학습된 트리 들을 해석함으로써 특징들의 유효성이 평가되었다. 실험 문장에 대해 운율구 경계의 유무를 결정하는 결정 트리의 오류율은 14.46%이었고, 운율구 사이의 휴지 기간과 음소 지속 시간 을 예측하기 위한 회귀 트리들의 평균 제곱 오류근(RMSE)이 각각 132msec, 22msec이었다. 수집된 모든 자료(400문장)로 학습한 결과, 운율구 경계 결정 오류율, 휴지 기간 및 지속시 간 RMSE의 10-fold cross-validation 추정치가 각각 13.77%, 127.91msec, 21.54msec이었다.

  • PDF

Prediction of Break Indices in Korean Read Speech (국어 낭독체 발화의 운율경계 예측)

  • Kim Hyo Sook;Kim Chung Won;Kim Sun Ju;Kim Seoncheol;Kim Sam Jin;Kwon Chul Hong
    • MALSORI
    • /
    • no.43
    • /
    • pp.1-9
    • /
    • 2002
  • This study aims to model Korean prosodic phrasing using CART(classification and regression tree) method. Our data are limited to Korean read speech. We used 400 sentences made up of editorials, essays, novels and news scripts. Professional radio actress read 400sentences for about two hours. We used K-ToBI transcription system. For technical reason, original break indices 1,2 are merged into AP. Differ from original K-ToBI, we have three break index Zero, AP and IP. Linguistic information selected for this study is as follows: the number of syllables in ‘Eojeol’, the location of ‘Eojeol’ in sentence and part-of-speech(POS) of adjacent ‘Eojeol’s. We trained CART tree using above information as variables. Average accuracy of predicting NonIP(Zero and AP) and IP was 90.4% in training data and 88.5% in test data. Average prediction accuracy of Zero and AP was 79.7% in training data and 78.7% in test data.

  • PDF