• 제목/요약/키워드: Speech Tone

검색결과 200건 처리시간 0.025초

발화 속도에 따른 국어의 경계 성조 연구 (Study of Boundary Tone according to Speech Rate in Korean)

  • 박미영
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2002년도 11월 학술대회지
    • /
    • pp.73-76
    • /
    • 2002
  • The purpose of this paper is to research Korean boundary tone of sentence type and perceptive speaker's attitude according to speech rate - three type. In view of the preceding study, Korean intonation's meaning is determined by boundary tone. Also, in my experimental results, Korean boundary tone of sentence type has preferential tone. However, Korean boundary tone of sentence type is not influential according to speech rate. The speech rate's change of three pattern is influential in auditor's perceptual response. The relationship between the pitch contour of boundary tone and speech rate is not significant.

  • PDF

코퍼스 기반 한국어 합성기의 억양 구현 방안 (A Method of Intonation Modeling for Corpus-Based Korean Speech Synthesizer)

  • 김진영;박상언;엄기완;최승호
    • 음성과학
    • /
    • 제7권2호
    • /
    • pp.193-208
    • /
    • 2000
  • This paper describes a multi-step method of intonation modeling for corpus-based Korean speech synthesizer. We selected 1833 sentences considering various syntactic structures and built a corresponding speech corpus uttered by a female announcer. We detected the pitch using laryngograph signals and manually marked the prosodic boundaries on recorded speech, and carried out the tagging of part-of-speech and syntactic analysis on the text. The detected pitch was separated into 3 frequency bands of low, mid, high frequency components which correspond to the baseline, the word tone, and the syllable tone. We predicted them using the CART method and the Viterbi search algorithm with a word-tone-dictionary. In the collected spoken sentences, 1500 sentences were trained and 333 sentences were tested. In the layer of word tone modeling, we compared two methods. One is to predict the word tone corresponding to the mid-frequency components directly and the other is to predict it by multiplying the ratio of the word tone to the baseline by the baseline. The former method resulted in a mean error of 12.37 Hz and the latter in one of 12.41 Hz, similar to each other. In the layer of syllable tone modeling, it resulted in a mean error rate less than 8.3% comparing with the mean pitch, 193.56 Hz of the announcer, so its performance was relatively good.

  • PDF

Effects of Age and Type of Stimulus on the Cortical Auditory Evoked Potential in Healthy Malaysian Children

  • Mukari, Siti Zamratol-Mai Sarah;Umat, Cila;Chan, Soon Chien;Ali, Akmaliza;Maamor, Nashrah;Zakaria, Mohd Normani
    • 대한청각학회지
    • /
    • 제24권1호
    • /
    • pp.35-39
    • /
    • 2020
  • Background and Objectives: The cortical auditory evoked potential (CAEP) is a useful objective test for diagnosing hearing loss and auditory disorders. Prior to its clinical applications in the pediatric population, the possible influences of fundamental variables on the CAEP should be studied. The aim of the present study was to determine the effects of age and type of stimulus on the CAEP waveforms. Subjects and Methods: Thirty-five healthy Malaysian children aged 4 to 12 years participated in this repeated-measures study. The CAEP waveforms were recorded from each child using a 1 kHz tone burst and the speech syllable /ba/. Latencies and amplitudes of P1, N1, and P2 peaks were analyzed accordingly. Results: Significant negative correlations were found between age and speech-evoked CAEP latency for each peak (p<0.05). However, no significant correlations were found between age and tone-evoked CAEP amplitudes and latencies (p>0.05). The speech syllable /ba/ produced a higher mean P1 amplitude than the 1 kHz tone burst (p=0.001). Conclusions: The CAEP latencies recorded with the speech syllable became shorter with age. While both tone-burst and speech stimuli were appropriate for recording the CAEP, significantly bigger amplitudes were found in speech-evoked CAEP. The preliminary normative CAEP data provided in the present study may be beneficial for clinical and research applications in Malaysian children.

Effects of Age and Type of Stimulus on the Cortical Auditory Evoked Potential in Healthy Malaysian Children

  • Mukari, Siti Zamratol-Mai Sarah;Umat, Cila;Chan, Soon Chien;Ali, Akmaliza;Maamor, Nashrah;Zakaria, Mohd Normani
    • Journal of Audiology & Otology
    • /
    • 제24권1호
    • /
    • pp.35-39
    • /
    • 2020
  • Background and Objectives: The cortical auditory evoked potential (CAEP) is a useful objective test for diagnosing hearing loss and auditory disorders. Prior to its clinical applications in the pediatric population, the possible influences of fundamental variables on the CAEP should be studied. The aim of the present study was to determine the effects of age and type of stimulus on the CAEP waveforms. Subjects and Methods: Thirty-five healthy Malaysian children aged 4 to 12 years participated in this repeated-measures study. The CAEP waveforms were recorded from each child using a 1 kHz tone burst and the speech syllable /ba/. Latencies and amplitudes of P1, N1, and P2 peaks were analyzed accordingly. Results: Significant negative correlations were found between age and speech-evoked CAEP latency for each peak (p<0.05). However, no significant correlations were found between age and tone-evoked CAEP amplitudes and latencies (p>0.05). The speech syllable /ba/ produced a higher mean P1 amplitude than the 1 kHz tone burst (p=0.001). Conclusions: The CAEP latencies recorded with the speech syllable became shorter with age. While both tone-burst and speech stimuli were appropriate for recording the CAEP, significantly bigger amplitudes were found in speech-evoked CAEP. The preliminary normative CAEP data provided in the present study may be beneficial for clinical and research applications in Malaysian children.

K-ToBI (Korean ToBI) Labelling Conventions (Version 3.0)

  • Juo, Suo-Ah
    • 음성과학
    • /
    • 제7권1호
    • /
    • pp.143-169
    • /
    • 2000
  • This chapter presents an overview of Korean intonational structure and proposes a revised version of K -ToBI (Korean TOnes and Break Indices), a prosodic transcription convention for Seoul Korean. In the new version of K-ToBI, a tone tier is separated into two tiers: a phonological tone tier and a phonetic tone tier. A phonological tone tier labels tones marking the prosodic structure of an utterance, and a phonetic tone tier labels individual tones of an AP and an IP conforming to the surface pitch contour. Labelling surface tonal patterns will provide us data to test the underlying tonal patterns and to build phonetic implementation rules.

  • PDF

A Relationship of Tone, Consonant, and Speech Perception in Audiological Diagnosis

  • Han, Woo-Jae;Allen, Jont B.
    • 한국음향학회지
    • /
    • 제31권5호
    • /
    • pp.298-308
    • /
    • 2012
  • This study was designed to examine the phoneme recognition errors of hearing-impaired (HI) listeners on a consonant-by-consonant basis, to show (1) how each HI ear perceives individual consonants differently and (2) how standard clinical measurements (i.e., using a tone and word) fail to predict these differences. Sixteen English consonant-vowel (CV) syllables of six signal-to-noise ratios in speech-weighted noise were presented at the most comfortable level for ears with mild-to-moderate sensorineural hearing loss. The findings were as follows: (1) individual HI listeners with a symmetrical pure-tone threshold showed different consonant-loss profiles (CLPs) (i.e., over a set of the 16 English consonants, the likelihood of misperceiving each consonant) in right and left ears. (2) A similar result was found across subjects. Paired ears of different HI individuals with identical pure-tone threshold presented different CLPs in one ear to the other. (3) Paired HI ears having the same averaged consonant score demonstrated completely different CLPs. We conclude that the standard clinical measurements are limited in their ability to predict the extent to which speech perception is degraded in HI ears, and thus they are a necessary, but not a sufficient measurement for HI speech perception. This suggests that the CV measurement would be a useful clinical tool.

인공와우이식 아동의 운율 특성 - 발화속도와 억양기울기를 중심으로 - (The Prosodic Characteristics of Children with Cochlear Implants with Respect to Speech Rate and Intonation Slope)

  • 오순영;성철재;최은아
    • 말소리와 음성과학
    • /
    • 제3권3호
    • /
    • pp.157-165
    • /
    • 2011
  • This study investigated speech rate and intonation slope (least square method; F0, quarter-tone) in normal and CI children's utterances. Each group consisted of 12 people and were divided into groups of children with CI operation (before 3;00), children with CI operation (after 3;00), and normal children. Materials are composed of four kinds of grammatical dialogue sentences which are lacking in respect. Given three groups as independent variables and both speech rate and intonation slope as dependent variables, a one-way ANOVA showed that normal children had faster speech rates and steeper intonation slopes than those of the CI group. More specifically, there was a statistically significant speech rate difference between normal and CI children in all of the sentential patterns but imperative form (p<.01). Additionally, F0 and qtone slope observed in sentential final word showed a significant statistical difference between normal and CI children in imperative form (f0: p<.01; q-tone: p<.05).

  • PDF

F0 변화율로 본 한국어 억양 패턴의 음향 특성 (Korean Intonation Patterns from the Viewpoint of F0 Percentage Change)

  • 이지연;이호영
    • 말소리와 음성과학
    • /
    • 제5권1호
    • /
    • pp.123-130
    • /
    • 2013
  • Previous researches on Korean intonation have been mainly focused on $F_0$ target frequencies, $F_0$ slope, and the duration of intonation patterns. This study investigated Korean intonation patterns, both boundary and phrasal tones, in relation to the $F_0$ percentage change between pitch targets. We measured the percentage change between the pitch targets of both boundary and phrasal tones. Additionally, the $F_0$ change between the preceding pitch target and the first pitch target of the boundary tone and the $F_0$ targets of the sequence of two LH phrasal tones ('LH + LH') were also measured. Two phrasal tones, LHLH and HLH, were compared with 'LH + LH' and the 'HLH' in the LHLH pattern respectively. We found that the percentage change between pitch targets in the phrasal tone is fixed to some extent. This helped explain why the slope of the phrasal tone is closely related to the number of syllables and the duration of the phrasal tone as discussed in previous studies. Since we analyzed the intonation patterns with the utterances from a large speech corpus, the results of this paper are expected to be used in building a larger annotated corpus of Korean.

음성으로부터 감성인식 요소분석 (Analyzing the element of emotion recognition from speech)

  • 심귀보;박창현
    • 한국지능시스템학회논문지
    • /
    • 제11권6호
    • /
    • pp.510-515
    • /
    • 2001
  • 일반적으로 음성신호로부터 사람의 감정을 인식할 수 있는 요소는(1)대화의 내용에 사용한 단어, (2)톤 (tore), (3)음성신호의 피치(Pitch), (4)포만트 주파수(Formant Frequencey)그리고 (5)말의 빠르기(Speech Speed)(6)음질(Voice Quality)등이다. 사람의 경우는주파수 같은 분석요소 보다 톤과 단어 빠르기, 음질로 감정을 받아들이게 되는것이 자연스러운 방법이므로 당연히 후자의 요소들이 감정을 분류하는데 중요한 인자로쓰일 수있다. 그리고, 종래는 주로 후자의 효소들을 이용하였는데, 기계로써 구현하기 위해서는 포만트 주파수를 사용할 수있게 되는것이 도움이 된다. 그러므로, 본 연구는 음성 신호로부터 피치와 포만트, 그리고 말의 빠르기 등을 이용하여 감성인식시스템을 구현하는것을 목표로 연구를 진행하고 있으며, 그 1단계 연구로서 본 논문에서는 화가 나서 내뱉는 말을 기반으로 하여 화난 감정의 독특한 특성을 찾아내었다.

  • PDF

The Phonetic Realization of High Tone in North Kyungsang Korean

  • Chang, Woo-Hyeok
    • 음성과학
    • /
    • 제11권3호
    • /
    • pp.37-54
    • /
    • 2004
  • The main goal of this study is to examine the current issue of the deletion of high tone vs. the downstep or upstep of high tone in North Kyungsang Korean (NKK). In this phonetic experiment, five native speakers of North Kyungsang Korean participated and two categories, such as compounds and two-word phrases were included as a test material. This experiment shows that when the first word belongs to the nonfinal class, the high tone of the second word is overwhelmingly deleted. When the first word belongs to the final class, the high tone of it is also overwhelmingly deleted. It is thus concluded that when two words are combined into a phrase, the peak of one word retains, whereas the peak of the other is deleted. It is confirmed that a single high tone prominence in a phonological phrase in NKK is not due to the processes of down step or upstep but the deletion process.

  • PDF