• 제목/요약/키워드: Female speakers

검색결과 124건 처리시간 0.025초

Growth curve modeling of nucleus F0 on Korean accentual phrase

  • Yoon, Tae-Jin
    • 말소리와 음성과학
    • /
    • 제9권3호
    • /
    • pp.17-23
    • /
    • 2017
  • The present study investigates the effect of Accentual Phrase on F0 using a subset of large-scale corpus of Seoul Korean. Four syllable words which were neither preceded nor followed by silent pauses were presumed to be canonical exemplars of Accentual Phrases in Korean. These four syllable words were extracted from female speakers' speech samples. Growth curve analyses, combination of regression and polynomial curve fitting, were applied to the four syllable words. Four syllable words were divided into four groups depending on the categorical status of the initial segment: voiceless obstruents, voiced obstruents, sonorants, and vowels. Results of growth curve analyses indicate that initial segment types have an effect on the F0 (in semitone) in the nucleus of the initial syllable, and the cubic polynomial term revealed that some of the medial low tones in the 4 syllable words may be guided by the principle of contrast maximization, while others may be governed by the principle of ease of articulation.

서울 방언 어두 폐쇄음의 후속모음 F0 (F0 as a primary cue for signaling word-initial stops of Seoul Korean)

  • 변희경
    • 말소리와 음성과학
    • /
    • 제8권1호
    • /
    • pp.25-36
    • /
    • 2016
  • Previous studies showed that the voice onset time (VOT) of aspirated and lenis stops has been merged, and post-stop fundamental frequency (F0) has emerged as a primary cue to distinguish the two stops in the younger generation and female speech. The purpose of this study is to demonstrate that VOT merger in aspirated and lenis stops occurs after an F0 difference between the two stops becomes stabilized. In other words, unless post-stop F0, which is a redundant feature, is fully developed, it is hard for VOT merger to happen. Females have got a stable F0 difference in stops earlier than males. Therefore, VOT merger could happen, and as a result, females could take the lead in changing from VOT to F0 in initial stops. This study also shows that speakers who acquired F0 as a primary cue use F0 to the full to distinguish lenis stops from two other stops (aspirated and fortis).

벅아이 코퍼스를 이용한 영어 무성파열음의 VOT 연구 (A Study on the Voice Onset Time of English Voiceless Stops in the Buckeye Corpus)

  • 윤규철
    • 말소리와 음성과학
    • /
    • 제4권2호
    • /
    • pp.33-40
    • /
    • 2012
  • The purpose of this paper is to investigate the voice onset time (VOT) of the English voiceless stops [p, t, k] found in the Buckeye Corpus of Conversational Speech [1]. Three young female speakers were chosen for this study and their VOT values were semi-automatically extracted along with other factors. The factors used for the analysis were place of articulation, location in word, syllabic stress, content word or not, word frequency calculated from the corpus, and the speech rate expressed in syllables per second. Results showed that, for the three places of articulation of each speaker, all the factors had a statistically significant effect on the VOT values. This paper has significance in that the materials used for the analysis were from a corpus of spontaneous natural English speech.

Momel을 이용한 한국어의 억양 연구 (A Study on Korean Intonation Using Momel)

  • 김선희;유현지;홍혜진;이호영
    • 대한음성학회지:말소리
    • /
    • 제63호
    • /
    • pp.85-100
    • /
    • 2007
  • This paper aims to propose how to extract intonation patterns using Momel, a pitch stylization algorithm, and to present results of analyzing speech corpora in comparison with those in earlier researches. Two speech corpora are used: one is the sound files obtained from the K-ToBI web site, and the other consists of 80 passages pronounced by 4 speakers (2 male and 2 female). The results show that Momel provides significant pitch targets which can be labeled as H and L tones within prosodic units such as Accentual Phrase (AP) and Intonation Phrase (IP). The resulting AP patterns and IP boundary tone patterns correspond to those in earlier researches. Thus, this study will contribute to the study of intonation as well as to the development of automatic intonation labeling systems.

  • PDF

낮은 차원의 벡터 변환을 통한 음성 변환 (Voice conversion using low dimensional vector mapping)

  • 이기승;도원;윤대희
    • 전자공학회논문지S
    • /
    • 제35S권4호
    • /
    • pp.118-127
    • /
    • 1998
  • In this paper, we propose a voice personality transformation method which makes one person's voice sound like another person's voice. In order to transform the voice personality, vocal tract transfer function is used as a transformation parameter. Comparing with previous methods, the proposed method can obtain high-quality transformed speech with low computational complexity. Conversion between the vocal tract transfer functions is implemented by a linear mapping based on soft clustering. In this process, mean LPC cepstrum coefficients and mean removed LPC cepstrum modeled by the low dimensional vector are used as transformation parameters. To evaluate the performance of the proposed method, mapping rules are generated from 61 Korean words uttered by two male and one female speakers. These rules are then applied to 9 sentences uttered by the same persons, and objective evaluation and subjective listening tests for the transformed speech are performed.

  • PDF

발화속도가 경계앞 음절 길이에 미치는 영향 (The Effects of the Speaking Rate on the Duration of Syllable before Boundary)

  • 이순향;구희산
    • 음성과학
    • /
    • 제1권
    • /
    • pp.103-111
    • /
    • 1997
  • The purpose of this study was to investigate the effect of the speaking rate on the duration of syllable before boundary. The materials used were four types of syllable-boundary sequences(Go-'Ga' Boundary-Gu) in a paragraph. The duration of 'Ga' syllables before 4 level of boundary was measured, and all of the measurements were taken from signals and spectrograms made by the $Signalyze^{TM}$ 3.04 for Power Mac 7200. Subjects were six female speakers who read the materials at fast, normal, and slow speed five times. The results show that (1) the slower the speaking rate becomes, the longer the duration of syllable before boundary, (2) the duration rank of syllable before each boundary does not correspond to the level of boundary, eg. at fast speed, = < #, + < $ ; at normal speed, +, #, = < $ ; at slow speed, + < =, #, $, and (3) the syllable before sentence boundary is less influenced than syllable before another boundary.

  • PDF

시공간 패턴인식 신경망에 의한 단어 인식에 관한 연구 (A Study on Recognition of Spoken Numbers Using Spatio-Tempora1 Pattern Recognizer)

  • 박경철;김헌기;이종호
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1993년도 하계학술대회 논문집 A
    • /
    • pp.495-497
    • /
    • 1993
  • This paper presents spoken numbers recognition method using a spatio-temporal network This network is efficient in processing the spectrum sequences of speech patterns as spatio-temporal patterns. The number of windows and channels is experimentally determined. The recognition rate has been improved by experiments done on various parameters. The test data is collected form 10 numbers spoken by 2 male and female speakers. A recognition rate of 80% was obtained on a test set of 50 words.

  • PDF

음성인식 시스템의 성능 향상을 위한 잡음음성의 남성 및 여성화자의 음성식별 (Speech Identification of Male and Female Speakers in Noisy Speech for Improving Performance of Speech Recognition System)

  • 최재승
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2017년도 추계학술대회
    • /
    • pp.619-620
    • /
    • 2017
  • 본 논문에서는 음성인식 알고리즘에 매우 중요한 정보를 제공하는 화자의 성별인식을 위하여 신경회로망을 사용하여 잡음 환경 하에서 남성음성 및 여성음성의 화자를 식별하는 성별인식 알고리즘을 제안한다. 본 논문에서 제안하는 신경회로망은 MFCC의 계수를 사용하여 음성의 각 구간에서 남성음성 및 여성음성의 화자를 인식할 수 있는 알고리즘이다. 실험결과로부터 백색잡음이 중첩된 잡음환경 하에서 음성신호의 MFCC의 특징벡터를 사용함으로써 남성음성 및 여성음성의 화자에 대해서 양호한 성별인식 결과가 구해졌다.

  • PDF

경상방언 대학생들이 발음한 국어 한자어 장단음 분석 (An Analysis of Short and Long Syllables of Sino-Korean Words Produced by College Students with Kyungsang Dialect)

  • 양병곤
    • 말소리와 음성과학
    • /
    • 제7권4호
    • /
    • pp.131-138
    • /
    • 2015
  • The initial syllables of a pair of Sino-Korean words are generally differentiated in their meaning by either short or long durations. They are realized differently by the dialect and generation of speakers. Recent research has reported that the temporal distinction has gradually faded away. The aim of this study is to examine whether college students with Kyungsang dialect made the distinction temporally using a statistical method of Mixed Effects Model. Thirty students participated in the recording of five pairs of Korean words in clear or casual speaking styles. Then, the author measured the durations of the initial syllables of the words and made a descriptive analysis of the data followed by applying Mixed Effects Models to the data by setting gender, length, and style as fixed effects, and subject and syllable as random effects, and tested their effects on the initial syllable durations. Results showed that college students with Kyungsang dialect did not produce the long and short syllables distinctively with any statistically significant difference between them. Secondly, there was a significant difference in the duration of the initial syllables between male and female students. Thirdly, there was also a significant difference in the duration of the initial syllables produced in the clear or casual styles. The author concluded that college students with Kyungsang dialect do not produce long and short Sino-Korean syllables distinctively, and any statistical analysis on the temporal aspect should be carefully made considering both fixed and random effects. Further studies would be desirable to examine production and perception of the initial syllables by speakers with various dialect, generation, and age groups.

영어의 기본모음과 한국인 영어학습자의 영어모음 발화비교 (The comparison of cardinal vowels between Koreans and native English speakers)

  • 강성관;손현성;전병만;김현기
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2007년도 한국음성과학회 공동학술대회 발표논문집
    • /
    • pp.71-73
    • /
    • 2007
  • The Purpose of the study is to give Korean-English leaners better knowledge on vowel sounds in their learning English. The traditional description of the cardinal vowel system developed by Daniel Johns in 1917 is not enough to provide English learners with clear ideas in producing native like vowel sounds. For the reason, three Korean-native subjects, one male, one female and one child are chosen to produce 7 cardinal vowels and compare them with native English and American speaker's vowel sounds. The difference of produced vowels sounds is quantified and visualized by employing Sona-match program. The results have been fairly remarkable. Firstly, Korean-English learner's vowel sounds are articulated differently from their intention of vowel production. Secondly, the tongue positions of Koreans are placed slightly more down and forward to the lips than those of English and Americans. However, the front vowel /i/ sound is quite close to English and Americans. Lastly the mid-vowel /${\partial}$/ sound is not produced in any articulations of Korean-native speakers. It is thought that the mid vowel, /${\partial}$/ is a type of a weak sound regarded as 'schwa' which needs a great deal of exposure to the language to acquire a physical skill of articulation.

  • PDF