• 제목/요약/키워드: intonation phrase

검색결과 66건 처리시간 0.02초

일본어의 초점 실현과 인토네이션의 구조 (Realizations of Discourse Focus and Structure of Intonation in Japanese)

  • 최영숙
    • 음성과학
    • /
    • 제9권4호
    • /
    • pp.187-200
    • /
    • 2002
  • The purpose of the present study is to see in terms of $F_{0}$ variation in Japanese how discourse focus and the lexical word accent interact with each other in realizing overall intonation patterns. Discourse focus causes prosodic restructuring of phrase structures and, as a result, largely affects pitch contours, whereas the lexical word accent is said to delimit the $F_{0}$ into a certain range. Measurement of $F_{0}$ was made of utterances of Japanese sentences to observe behavior of pitch contours with varied focus assignment and lexical accent specifications. The utterances were obtained in question-answer discourse contexts so that in a sentence, either one NP was always focused or no focus was assigned. I set four points for $F_{0}$ measurement; $F_{1s},F_{1m}, F_{2s}$, and $F_{2m}$, two for each noun phrase corresponding to $F_{0}$ at the beginning of the first syllable and that of the vocalic portion of the second syllable in the two NP's. The results of present study were as follows: (1) for all combination of lexical accent types, the $F_{0}$ rise both in NP1 and NP2 are higher when focused than when not focused. (2) NP2 starts a new accentual phrase when focused, showing even higher $F_{0}$ than NP1, the latter of which implies that in forming a new accentual phrase by focusing, catathesis does not seem to take effect on NP2 preceded by accented NP1. (3) unfocused NP2 preceded by unaccented NP1 has higher $F_{0}$ than those preceded by accented NP1.

  • PDF

일본어 악센트 특징을 이용한 합성단위 선택 기반 일본어 TTS의 후보 합성단위의 사전선택 방법 (A Pre-Selection of Candidate Units Using Accentual Characteristic In a Unit Selection Based Japanese TTS System)

  • 나덕수;민소연;이광형;이종석;배명진
    • 한국음향학회지
    • /
    • 제26권4호
    • /
    • pp.159-165
    • /
    • 2007
  • 본 논문에서는 합성단위 선택 (unit selection) 기반 일본어 합성기에 필요한 후보 합성단위들에 대한 사전선택 (pre-selection)의 새로운 방법을 제안한다. 일반적인 사전선택 방법은 하나의 억양구에서 음소 열에 대한 비용을 계산하여 이용하는 방법이다. 그런데, 일본어는 다른 언어와는 다르게 상대적인 피치의 높낮이로 나타나는 악센트를 가지는 언어이고, 몇 개의 단어가 하나의 악센트구를 형성하는 특징이 있다. 또한 일본어의 운율은 악센트 구를 기본 단위로 하여 변화하는 특징이 있어서, 사전선택에서 이러한 악센트 구 단위의 운율 변화를 반영함으로써 음질을 향상시킬 수 있고, 악센트 구에서 음소 열에 대한 비용을 계산하여 억양구에서 하는 것보다 계산량을 줄일 수 있다. 제안한 방법은 일본어의 악센트 구를 정의하여 음소 열에서 이것을 분석하고, 각 악센트 구에서 합성 할 음소의 각 후보에 대해 CCL (Connected Context Length)을 구하는 악센트 구 매칭을 이용하여 사전선택을 수행하는 방법이다. 제안한 방법은 Voiceware의 합성기인 VoiceText를 baseline 시스템으로 사용하여 구현하였고, 인지적 에러 (억양 에러, 연결 에러)와 합성시간에 대해 평가하였다. 실험 결과, 제안한 방법은 합성 음질을 보다 자연스럽게 향상시켰고, 합성 속도를 개선하였다.

L2 억양에 나타나는 L1억양의 긍정적 전이와 부정적 전이 양상 - 일본인 한국어 학습자들을 중심으로 - (Positive and negative transfer of first language in producing second language - Focusing on Japanese learners of Korean -)

  • 윤영숙
    • 말소리와 음성과학
    • /
    • 제8권4호
    • /
    • pp.71-78
    • /
    • 2016
  • The purpose of this study is to investigate the effect of Japanese(L1) on the production of Korean accentual phrases(L2). Korean and Japanese have a similar prosodic structure. But different from Korean, Japanese is a pitch accent language. So each word has its own pitch accent. And pitch accents are maintained in the sentence intonation. This difference will have a negative influence on the production of Korean sentence intonation. For this study 4 Korean natives speakers and 10 advanced Japanese learners of Korean participated in the production test. The material analysed constituted 11 Korean sentences, six of which contain formally identical Sino-Korean and Sino-Japanese words. The results show that the initial pitch pattern of Korean accentual phrases was affected by Japanese pitch accent types and this interference was greater for formally identical Sino-Korean and Sino-Japanese words. But besides initial tones of accentual phrase, some positive interference was observed in the internal tonal pattern of accentual phrase. In the phonetic realization, the internal pitch range and initial pitch rising of accentual phrases was greater for Japanese learners of Korean than native speakers of Korean.

영어 명사구와 복합명사의 억양 실현 양상과 지각 (Intonational Realization and Perception of English Noun Phrases and Compound Nouns)

  • 강선미;김미혜;전윤실;김기호
    • 음성과학
    • /
    • 제12권4호
    • /
    • pp.153-166
    • /
    • 2005
  • This paper attempts to examine the accent implementation and perception of noun phrases and compound nouns in English sentences, arguing that primary stress of noun phrase and compound noun is realized in relative prominence in intonation. The production test examines how the stress patterns of the noun phrases and compound nouns are realized in intonation of the English native speakers' utterances. The perception test investigates English and Korean listeners' comprehension of the intonation of the noun phrases and compound nouns. And the results of this experimental study show that speakers and listeners produce and perceive the primary stress as a relatively prominent accent even if in contrast of English listeners, Korean learners have difficulty in using the cue of pitch accent location and figuring out compound nouns and noun phrases.

  • PDF

한국어 운율구 기반의 피치궤적 변환의 통계적 접근 (Statistical Approaches to Convert Pitch Contour Based on Korean Prosodic Phrases)

  • Lee, Ki-Young
    • The Journal of the Acoustical Society of Korea
    • /
    • 제23권1E호
    • /
    • pp.10-15
    • /
    • 2004
  • In performing speech conversion from a source speaker to a target speaker, it is important that the pitch contour of the source speakers utterance be converted into that of the target speaker, because pitch contour of a speech utterance plays an important role in expressing speaker's individuality and meaning of the utterance. This paper describes statistical algorithms of pitch contour conversion for Korean language. Pitch contour conversions are investigated at two 1 evels of prosodic phrases: intonational phrase and accentual phrase. The basic algorithm is a Gaussian normalization [7] in intonational phrase. The first presented algorithm is combined with a declination-line of pitch contour in an intonational phrase. The second one is Gaussian normalization within accentual phrases to compensate for local pitch variations. Experimental results show that the algorithm of Gaussian normalization within accentual phrases is significantly more accurate than the other two algorithms in intonational phrase.

발화 속도와 말차례 교체 빈도에 따른 운율 단위 변화에 관한 연구 (A study on the change of prosodic units by speech rate and frequency of turn-taking)

  • 원유권
    • 말소리와 음성과학
    • /
    • 제14권2호
    • /
    • pp.29-38
    • /
    • 2022
  • 이 연구는 국립국어원 일상 대화 음성 코퍼스(2020)에서 나타나는 발화를 분석하여 발화 속도 및 말차례 교체 빈도가 운율 단위 변화에 어떤 영향을 끼치는지 밝히는 것을 목적으로 하였다. 분석 결과, 발화 속도가 증가할수록 억양구, 어절 빈도, 발화 길이가 증가하는 양의 상관관계를 보였으나 상관관계가 낮았고, 회귀모형의 적합도는 3%-11%로 설명력이 약했다. 말차례 교체 빈도에 따른 평균 발화 속도는 유의미한 차이가 있었고, 말차례 교체 빈도가 증가할수록 발화 속도는 감소하였다. 또한 말차례 교체 빈도가 증가할수록 억양구 및 어절 빈도와 발화 길이는 감소하였으며 높은 음의 상관관계가 있는 것으로 나타났다. 회귀 모형의 적합도는 27%-32%로 계산되었다. 말차례 교체 빈도가 발화 속도와 운율 단위를 변화시키는 요인으로 작용했을 수 있다. 이는 대화체에서 나타나는 비유창성, 말차례 교체 특성, 화자 간 활발한 상호작용 등이 영향을 미쳤을 것이라 추측된다.

음성 코퍼스 구축을 위한 SiTEC 분절음.운율 레이블링 기준의 검토 및 제안 (Some considerations on SiTEC segmental and prosodic labeling convention for Korean)

  • 이숙향;신지영;김봉완;이용주
    • 대한음성학회지:말소리
    • /
    • 제46호
    • /
    • pp.127-143
    • /
    • 2003
  • This paper presents segmental labeling conventions proposed by SiTEC (Speech Information Technology Engineering Center) 2002 and proposes a new directions of a revision for a simpler version. The paper also reviews one of the prosody labelling conventions for Korean, K-ToBI convention(ver. 3.1) and proposes a couple of modifications and suggestions.

  • PDF

K-ToBI (Korean ToBI) Labelling Conventions (Version 3.0)

  • Juo, Suo-Ah
    • 음성과학
    • /
    • 제7권1호
    • /
    • pp.143-169
    • /
    • 2000
  • This chapter presents an overview of Korean intonational structure and proposes a revised version of K -ToBI (Korean TOnes and Break Indices), a prosodic transcription convention for Seoul Korean. In the new version of K-ToBI, a tone tier is separated into two tiers: a phonological tone tier and a phonetic tone tier. A phonological tone tier labels tones marking the prosodic structure of an utterance, and a phonetic tone tier labels individual tones of an AP and an IP conforming to the surface pitch contour. Labelling surface tonal patterns will provide us data to test the underlying tonal patterns and to build phonetic implementation rules.

  • PDF

Prosodic Conditions for Epenthetic Nasals

  • Kim, Soo-Jung
    • 음성과학
    • /
    • 제7권4호
    • /
    • pp.123-148
    • /
    • 2000
  • This paper investigates prosodic conditions for the epenthetic /n/ in Korean. It has been claimed that an epenthetic /n/ appears across prosodic words (Han 1994, Lee 1996). However, using acoustic data as well as aerodynamic data, I argue that the epenthetic /n/ does not always surface across all prosodic words, but that its appearance is prosodically restricted. I further demonstrate that it appears only across prosodic words within an accentual phrase. This finding provides empirical support for the intonation-based model of Korean prosodic structure studies.

  • PDF

모음길이 비율에 따른 발화속도 보상을 이용한 한국어 음성인식 성능향상 (An Improvement of Korean Speech Recognition Using a Compensation of the Speaking Rate by the Ratio of a Vowel length)

  • 박준배;김태준;최성용;이정현
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2003년도 컴퓨터소사이어티 추계학술대회논문집
    • /
    • pp.195-198
    • /
    • 2003
  • The accuracy of automatic speech recognition system depends on the presence of background noise and speaker variability such as sex, intonation of speech, and speaking rate. Specially, the speaking rate of both inter-speaker and intra-speaker is a serious cause of mis-recognition. In this paper, we propose the compensation method of the speaking rate by the ratio of each vowel's length in a phrase. First the number of feature vectors in a phrase is estimated by the information of speaking rate. Second, the estimated number of feature vectors is assigned to each syllable of the phrase according to the ratio of its vowel length. Finally, the process of feature vector extraction is operated by the number that assigned to each syllable in the phrase. As a result the accuracy of automatic speech recognition was improved using the proposed compensation method of the speaking rate.

  • PDF