• 제목/요약/키워드: Formant Frequency

검색결과 183건 처리시간 0.021초

폐쇄자음군의 폐쇄구간 축소에 따른 위치성 지각에 대한 재해석 (Reinterpretation of the Perception of Place Cues in the Reduced Closure Duration of Stop Consonant Clusters)

  • 이석재
    • 대한음성학회지:말소리
    • /
    • 제45호
    • /
    • pp.1-14
    • /
    • 2003
  • This paper criticizes S. Kim (1992), claiming that the perception of place cues in the reduced stop consonant clusters ('reducing' means 'cutting off' the acoustic silence in stop clusters) largely depends on the acoustic characteristics such as formant transition and noise frequency distribution of stop burst, rather than the closure duration time as advocated by S. Kim (1992). The claim is based on the perception test conducted upon 111 stimuli over 10 subjects. The finding is that, when the closure duration is cut off up to the point where only one stop is perceived, place of the second stop, not the first one, in the cluster is in most cases perceived regardless of the places of the first and second stops. It is likely that the place cues of the stop in the prevocalic position mask those in the postvocalic position.

  • PDF

음장과 외이도 내부에서의 음성 비교 (The comparison of the voice between the free field and the external auditory canal)

  • 허승덕;김리석;고도흥;이정학
    • 음성과학
    • /
    • 제7권4호
    • /
    • pp.83-90
    • /
    • 2000
  • The purpose of this study was to examine some acoustic characteristics in the ear canal. It was assumed that a sound outside the external auditory canal could be different from the sound inside the external auditory canal. The acoustic signals were captured by a probe microphone placed at a distance within 1 cm from the tympanic membrane, and a reference microphone was placed over the upper pinna. Three vowels /a/, /i/, /u/ were recorded from a normal adult male speaker. The parameters such as the formant frequency ($Fl\simF5$) and the peak intensity were measured using a speech analyser, PCquirer. It was found that the entering part of the external auditory canal functions as a narrowing point as to the speech that passes through the free field. Results show that acoustic characteristics were changed for speech discrimination rather than speech perception.

  • PDF

대구 방언 단모음의 세대 간 차이에 대한 음향 음성학적 연구 (An Acoustic Study on the Generational Difference of the Monophthongs in the Daegu Dialect)

  • 장혜진;신지영
    • 대한음성학회지:말소리
    • /
    • 제57호
    • /
    • pp.15-30
    • /
    • 2006
  • This paper investigates differences between generations in the vowel system of the Daegu dialect in terms of F1 and F2 of the monophthongs. Three different groups of subjects participated in the present study: 20 female native speakers of the Daegu dialect(10 in their 20's and 10 in their 40's), and 10 female native speakers of the Seoul dialect as a control group. It has been assumed that the Daegu dialect has six vowels. However, younger generation appears to have 7 vowels different from older generation. The result of the present study showed that the Daegu dialect has different vowel systems between generations: for 40's have six vowels and 20's have seven vowels. These differences seems to be attributed to the influence of the Seoul dialect.

  • PDF

포먼트 주파수 특성에 근거한 신장 질환과 순음(層音)간의 비교·분석 (A Comparison and Analysis of Kidney Diseases and a Labial Sound Based on Formant Frequency Extraction)

  • 김봉현;가민경;이세환;곽지현;조동욱
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2008년도 추계학술발표대회
    • /
    • pp.137-140
    • /
    • 2008
  • 현대 사회는 육체적·정신적 활동을 많이 요구하게 되며 이러한 현상으로 스트레스의 증가와 이유없는 증후군의 발병이 점차 확대되고 있다. 특히 스트레스로 인한 피로의 누적으로 인체의 혈액 농도 및 순환에 영향을 끼치게 되며 이로 인해 신장의 상태가 악화될 수 있다. 따라서 신장의 이상 유무를 조기에 판단하여 적절한 조치를 취하는 것이 무엇보다 중요하다. 이를 위해 본 논문에서는 신장 질환을 앓고 있는 환자와 정상인을 대상으로 피실험자 집단군을 각각 구성하고 음성 정보의 형태학적 분석과 수치학적 분석을 실험으로 출력하고 결과값에 대해 비교·분석을 행하고자 한다. 최종적으로 실험을 통해 신장과 음성과의 관계를 정립하고자 한다.

A Study on Correcting Korean Pronunciation Error of Foreign Learners by Using Supporting Vector Machine Algorithm

  • Jang, Kyungnam;You, Kwang-Bock;Park, Hyungwoo
    • International Journal of Advanced Culture Technology
    • /
    • 제8권3호
    • /
    • pp.316-324
    • /
    • 2020
  • It has experienced how difficult People with foreign language learning, it is to pronounce a new language different from the native language. The goal of various foreigners who want to learn Korean is to speak Korean as well as their native language to communicate smoothly. However, each native language's vocal habits also appear in Korean pronunciation, which prevents accurate information transmission. In this paper, the pronunciation of Chinese learners was compared with that of Korean. For comparison, the fundamental frequency and its variation of the speech signal were examined and the spectrogram was analyzed. The Formant frequencies known as the resonant frequency of the vocal tract were calculated. Based on these characteristics parameters, the classifier of the Supporting Vector Machine was found to classify the pronunciation of Koreans and the pronunciation of Chinese learners. In particular, the linguistic proposition was scientifically proved by examining the Korean pronunciation of /ㄹ/ that the Chinese people were not good at pronouncing.

소음이 외국어 학습에 미치는 영향 (Noise Effects on Foreign Language Learning)

  • 임은수;김현기;김병삼;김종교
    • 음성과학
    • /
    • 제6권
    • /
    • pp.197-217
    • /
    • 1999
  • In a noisy class, the acoustic-phonetic features of the teacher and the perceptual features of learners are changed comparison with a quiet environment. Acoustical analyses were carried out on a set of French monosyllables consisting of 17 consonants and three vowel /a, e, i/, produced by 1 male speaker talking in quiet and in 50, 60 and 70 dB SPL of masking noise on headphone. The results of the acoustic analyses showed consistent differences in energy and formant center frequency amplitude of consonants and vowels, $F_1$ frequency of vowel and duration of voiceless stops suggesting the increase of vocal effort. The perceptual experiments in which 18 undergraduate female students learning French served as the subjects, were conducted in quiet and in 50, 60 dB of masking noise. The identification scores on consonants were higher in Lombard speech than in normal speech, suggesting that the speaker's vocal effort is useful to overcome the masking effect of noise. And, with increased noise level, the perceptual response to the French consonants given had a tendency to be complex and the subjective reaction score on the noise using the vocabulary representative of 'unpleasant' sensation to be higher. And, in the point of view on the L2(second language) acquisition, the influence of L1 (first language) on L2 examined in the perceptual result supports the interference theory.

  • PDF

CELP보코더에서 Line Spectrum Frequency를 이용한 고속 피치검색 (A New Fast Pitch Search Algorithm using Line Spectrum Frequency in the CELP Vocoder)

  • 배명진;손상목;유하영;변경진
    • 한국음향학회지
    • /
    • 제15권2호
    • /
    • pp.90-94
    • /
    • 1996
  • 부호여기된 선형예측(CELP) 음성부호화기는 4.8kbps이하의 낮은 전송 비율에서도 좋은 성능을 갖는다. CELP형 부호기의 단점은 많은 계산량을 필요로 한다는 것이다. 본 논문에서, 우리는 복잡성을 줄이면서 CELP보코더의 음질을 유지하는 새로운 피치검색법을 제안하였다. 이 방법은 CELP보코더의 포만트 필터단에서 찾은 제 1 포만트를 이용하여 예비피치를 찾고, 피치검색을 예비피치 구간에서만 수행하는 것이다. 제안한 방법을 CELP보코더에 적용하므로써, 기존의 방법에 비해 약 64%의 복잡성이 감소되었다.

  • PDF

비강 공명이 한국어 모음에 미치는 음향학적 영향 (Effect of the Nasal Cavity Resonance on the Acoustic Characteristics of Korean Vowels)

  • 성명훈;오승하;강명구;고태용;김광현;김진영
    • 대한후두음성언어의학회지
    • /
    • 제4권1호
    • /
    • pp.24-32
    • /
    • 1991
  • Cleft palate or velopharyngeal incompetence shows many disorders and disabilities affecting speech transmission. including distortion. substitution. and the nasalization of the vowels. The nasalized vowels are produced primarily by lowering of the velum. resulting in opening a side passage for the air flow through the nasal cavity. These abnormal movements give rise to complex modification of the physical property of the sound or in the sound spectrum. The authors employed Sonagraph$^{\circledR}$ as a sound analyzer in order to ascertain the features which characterize the nasalization of vowels. Twenty healthy Korean male adult voluteers were analyzed in artificial conditions of anterior and posterior nasal obstruction. and velo-pharyngeal incompetence. The results were as follows : 1) Fundamental frequency was not changed by nasal obstruction or velopharyngeal incompetence. 2) There was no significant difference of the formant intensity between normal and nasal vowels. 3) In VPI, a decrease of the frequency of $F_2$ was observed in /e/ and /i/ vowels(p<0.001). 4) In VPI, the $F_2$ was frequently missed in /o/ and /u/ vowels. 5) In the consonant spectra of VPI, the 'release burst' was usually not observed.

  • PDF

음성합성시스템을 위한 음색제어규칙 연구 (A Study on Voice Color Control Rules for Speech Synthesis System)

  • 김진영;엄기완
    • 음성과학
    • /
    • 제2권
    • /
    • pp.25-44
    • /
    • 1997
  • When listening the various speech synthesis systems developed and being used in our country, we find that though the quality of these systems has improved, they lack naturalness. Moreover, since the voice color of these systems are limited to only one recorded speech DB, it is necessary to record another speech DB to create different voice colors. 'Voice Color' is an abstract concept that characterizes voice personality. So speech synthesis systems need a voice color control function to create various voices. The aim of this study is to examine several factors of voice color control rules for the text-to-speech system which makes natural and various voice types for the sounding of synthetic speech. In order to find such rules from natural speech, glottal source parameters and frequency characteristics of the vocal tract for several voice colors have been studied. In this paper voice colors were catalogued as: deep, sonorous, thick, soft, harsh, high tone, shrill, and weak. For the voice source model, the LF-model was used and for the frequency characteristics of vocal tract, the formant frequencies, bandwidths, and amplitudes were used. These acoustic parameters were tested through multiple regression analysis to achieve the general relation between these parameters and voice colors.

  • PDF

정현파 모델을 이용한 2.4kbps 음성부호화 알고리즘 (2.4kbps Speech Coding Algorithm Using the Sinusoidal Model)

  • 백성기;배건성
    • 한국통신학회논문지
    • /
    • 제27권3A호
    • /
    • pp.196-204
    • /
    • 2002
  • STC(Sinusoidal Transform Coding) 방식은 주파수 영역에서 음성신호의 스펙트럼 피크치들을 정현파로 모델링하여 합성하는 음성부호화 방식을 말한다. 저전송률 STC 방식에서는 스펙트럼의 모든 피크를 이용하는 대신, 기본 주파수와 고조파에 해당하는 스펙트럼 포락선에서의 크기와 그때의 위상을 이용하여 음성을 합성한다. 본 논문에서는 정현파 모델에 기반한 2.4kbps 음성부호화 알고리즘을 제안한다. 피치정보는 모든 스펙트럼 피크를 사용한 합성음과 선택된 주파수와 고조파를 이용한 합성음과의 평균자승에러를 이용하여 추정하고, 위상정보는 여기신호 펄스의 시작시기를 나타내는 onset time과 성도 모델 전달함수의 위상을 이용하여 얻는다. 크기정보는 SEEVOC 알고리즘과 선형예측계수를 이용하여 추정한다. 실험결과, 합성음의 스펙트럼 특성은 원음성의 포만트 정보를 대부분 가지고 있으며, 위상정보도 원음성의 위상을 잘 따라감을 확인하였다. 합성음의 음질평가를 위해서 informal한 MOS(Mean Opinion Score) 테스트를 시행하였으며, 2.0kbps의 HVXC와 비교하여 대체적으로 MOS 3.1 이상의 음질을 얻을 수 있었다.