• 제목/요약/키워드: Formant frequency

검색결과 183건 처리시간 0.027초

Gender difference in speech intelligibility using speech intelligibility tests and acoustic analyses

  • Kwon, Ho-Beom
    • The Journal of Advanced Prosthodontics
    • /
    • 제2권3호
    • /
    • pp.71-76
    • /
    • 2010
  • PURPOSE. The purpose of this study was to compare men with women in terms of speech intelligibility, to investigate the validity of objective acoustic parameters related with speech intelligibility, and to try to set up the standard data for the future study in various field in prosthodontics. MATERIALS AND METHODS. Twenty men and women were served as subjects in the present study. After recording of sample sounds, speech intelligibility tests by three speech pathologists and acoustic analyses were performed. Comparison of the speech intelligibility test scores and acoustic parameters such as fundamental frequency, fundamental frequency range, formant frequency, formant ranges, vowel working space area, and vowel dispersion were done between men and women. In addition, the correlations between the speech intelligibility values and acoustic variables were analyzed. RESULTS. Women showed significantly higher speech intelligibility scores than men and there were significant difference between men and women in most of acoustic parameters used in the present study. However, the correlations between the speech intelligibility scores and acoustic parameters were low. CONCLUSION. Speech intelligibility test and acoustic parameters used in the present study were effective in differentiating male voice from female voice and their values might be used in the future studies related patients involved with maxillofacial prosthodontics. However, further studies are needed on the correlation between speech intelligibility tests and objective acoustic parameters.

외부 자극에 따른 부비동과 포먼트주파수와의 상관성 분석 (Correlation Analysis of Between Paranasal Sinuses and Formant Frequency According to External Stimulation)

  • 김봉현
    • 한국정보통신학회논문지
    • /
    • 제17권8호
    • /
    • pp.1955-1961
    • /
    • 2013
  • 부비동은 얼굴에서 뼈 속에 존재하는 공기로 가득 찬 빈 공간이다. 그러나 부비동에 지속적으로 염증이 생기고 고름이 차면 축농증으로 발병하여 두통과 무기력증을 호소하고 음성의 변화를 가져온다. 따라서 본 논문에서는 외부 자극을 통해 부비동의 변화를 음성분석 요소로 측정하여 부비동 관련 질환을 예측하는 연구와 전두동, 사골동, 상악동, 접형동으로 구성된 부비동의 영역별 기능을 분석하는 연구를 수행하였다. 이를 위해 부비동 영역에 냉찜질 자극을 시행하고 자극 전과 후의 음성에 대한 포먼트주파수를 측정하여 상호간의 상관성 분석을 통해 외부 자극이 부비동에 미치는 영향을 분석하였다.

편도외 농양 환자의 발화시 조음 및 음성의 변화 (The Acoustic Characteristics of Articulation and Phonation in Peritonsillar Abscess)

  • 최현진;송윤경;여장옥;허세형;진성민
    • 대한후두음성언어의학회지
    • /
    • 제19권2호
    • /
    • pp.133-135
    • /
    • 2008
  • Background and Objectives: The voice changes can occur in peritonsillar abscess and the labeling of this changes as a "muffled voice". The aim of this study was to investigate the changes in acoustic feature of voice before and after treatment in patients with peritonsillar abscess. Materials and Method: 12 patients with peritonsillar abscess were enrolled in the study. Acoustic analysis on sustained Korean vowels /a/, /i/ and /u/ were performed before and after treatment. Results: In patients with peritonsillar abscess, the first formant frequency (F1) and second formant frequency (F2) of /a/ were decreased. There was tendency of articulation of back-low vowel /a/ as back-high vowel /u/. F1 of /i/ and /u/ were increased, while F2 were decreased. There was tendency of articulation of front-high vowel /i/ as back-low vowel /a/. The third, forth, fifth formant frequency (F3, F4, F5) of /a/, /i/ and /u/ were decreased although statistically not significant. Conclusion: The anatomical and functional changes of oropharynx by peritonsillar abscess can cause changes in resonance and speech quality. We suggest that these changes could be the cause of 'muffled voice' in patients of peritonsillar abscess.

  • PDF

직.간접흡연 환경에서의 성대 및 음형대 변화에 대한 음성 분석학적 연구 (A Study on Voice Analytical the Vocal Cord and Formant Change in the Smoking and Secondhand Smoking Environments)

  • 김봉현;조동욱
    • 한국통신학회논문지
    • /
    • 제36권6B호
    • /
    • pp.720-727
    • /
    • 2011
  • 웰빙이 새로운 미래 사회적 이슈로 부각되면서 건강관리 및 유지에 대한 현대인들의 관심이 증대되고 있다. 특히, 흡연에 대한 좋지 않은 인식이 높아지면서 대대적인 금연 운동이 확산되고 있는 실정이다. 흡연은 인체의 호흡기와 순환기 등에 많은 악영향을 미치며 직접적인 흡연뿐만 아니라 간접흡연도 동일한 증상이 유발되는 치명적인 행위로 인식되고 있다. 따라서 본 논문에서는 직접흡연과 간접흡연 환경에서 성대 및 음형대에 미치는 영향을 음성 분석학적 요소 기술의 적용을 통해 비교, 분석하는 연구를 수행하였다. 이를 위해 20대 남성을 대상으로 흡연자와 비흡연자로 피실험자 집단을 구성하고 직 간접흡연 전과 후의 음성을 수집하여 Pitch, Jitter, Shimmer 및 5~8 Formant Frequency를 적용한 실험 결과를 추출, 분석하는 연구를 수행하였다.

고음질 음성합성을 위한 LSP를 이용한 피치검출 성능향상에 관한 연구 (A Study on the Pitch Extraction Improvement Using LSP for the Synthesis of High Speech Quality)

  • 서지호;김종국;배명진
    • 한국음향학회지
    • /
    • 제29권1호
    • /
    • pp.69-75
    • /
    • 2010
  • 본 논문에서는 스펙트럼 신호를 최대한 평탄화시킴으로써 포만트의 영향을 제거하고 고조파 성분을 분리해 내어 이를 피치검출에 사용한다. 스펙트럼 신호로부터 포만트의 영향과 천이진폭의 영향을 제거하기 위해 주파수 대역을 LSP(Line Spectrum Pair)를 기준으로 서브밴드로 나누고 각각의 서브밴드에서 기울기를 취한 후에 역기울기로 스펙트럼을 보상한다. 실험 결과 제안한 방법이 LPC법, Lifter법, Cepstrum법을 이용하여 평탄화시킬 때 보다 평탄화 정도가 좋아짐을 알 수 있다. 또한 제안한 방법 이외에 가장 양호한 성능을 나타낸 LPC법을 이용하여 피치를 구했을 때 제안한 방법의 조오율이 평균 1.30% 감소하였다. 또한 제안한 방법은 잡음을 부가한 음성의 경우에도 낮은 에러율을 보여 배경잡음에 강하다는 것을 알 수 있었다.

음성특성 학습 모델을 이용한 음성인식 시스템의 성능 향상 (Improvement of Speech Recognition System Using the Trained Model of Speech Feature)

  • 송점동
    • 정보학연구
    • /
    • 제3권4호
    • /
    • pp.1-12
    • /
    • 2000
  • 음성은 특성에 따라 고음성분이 강한 음성과 저음성분이 강한 음성으로 구분할 수 있다. 그러나 이제까지 음성인식의 연구에 있어서는 이러한 특성을 고려하지 않고, 인식기를 구성함으로써 상대적으로 낮은 인식률과 인식모델을 구성할 때 많은 데이터를 필요로 하고 있다. 본 논문에서는 화자의 이러한 특성을 포만트 주파수를 이용하여 구분할 수 있는 방법을 제안하고, 화자음성의 고음과 저음특성을 반영하여 인식모델을 구성한 후 인식하는 방법을 제안한다. 한국어에서 가능한 47개의 모노폰을 이용하여 인식모델을 구성하였으며, 여성과 남성 각각 20명의 음성을 이용하여 인식모델을 학습시켰다. 포만트 주파수를 추출하여 구성한 포만트 주파수 테이불과 피치 정보값을 이용하여 음성의 특성을 구분한 후, 음성특성에 따라 학습된 인식모델을 이용하여 인식을 수행하였다. 본 논문에서 제안한 시스템을 이용하여 실험한 결과 기존의 방법보다 인식률이 향상됨을 보였다.

  • PDF

일본인 학습자의 한국어 모음 발음에 대한 연구 (An Acoustic Study of the Pronunciation of Korean Vowels Uttered by Japanese Speakers)

  • 조성문
    • 음성과학
    • /
    • 제11권3호
    • /
    • pp.69-81
    • /
    • 2004
  • The purpose of this experimental study was to investigate characteristics of Korean vowels uttered by Japanese speakers. Eight Korean Vowels were uttered three times by ten male Korean and Japanese, female Korean and Japanese, respectively. Formant Frequencies were measured from sound spectrograms made by the Pitch Works. Results showed that female Japanese speakers uttered Korean vowels more similar to those uttered by Korean native speakers than did male Japanese speakers.. In particular, male Japanese speakers have articulatory problems pronouncing the back vowels(/ㅓ/, /ㅡ/, /ㅜ/). It appears that the width of male speakers' articulatory movements is comparatively narrower than those of female speakers.

  • PDF

A Study on Comparison of Pronunciation Accuracy of Soprano Singers

  • Song, Uk-Jin;Park, Hyungwoo;Bae, Myung-Jin
    • International journal of advanced smart convergence
    • /
    • 제6권2호
    • /
    • pp.59-64
    • /
    • 2017
  • There are three sorts of voices of female vocalists: soprano, mezzo-soprano, and contralto according to the transliteration. Among them, the soprano has the highest vocal range. Since the voice is generated through the human vocal tract based on the voice generation model, it is greatly influenced by the vocal tract. The structure of vocal organs differs from person to person, and the formants characteristic of vocalization differ accordingly. The formant characteristic refers to a characteristic in which a specific frequency band appears distinctly due to resonance occurring in each vocal tract in the vocal process. Formant characteristics include personality that occurs in the throat, jaw, lips, and teeth, as well as phonological properties of phonemes. The first formant is the throat, the second formant is the jaw, the third formant and the fourth formant are caused by the resonance phenomenon in the lips and the teeth. Among them, pronunciation is influenced not only by phonological information but also by jaws, lips and teeth. When the mouth is small or the jaw is stiff when pronouncing, pronunciation becomes unclear. Therefore, the higher the accuracy of the pronunciation characteristics, the more clearly the formant characteristics appear in the grammar spectrum. However, many soprano singers can not open their mouths because their jaws, lips, teeth, and facial muscles are rigid to maintain high tones when singing, which makes the pronunciation unclear and thus the formant characteristics become unclear. In this paper, in order to confirm the accuracy of the pronunciation characteristics of soprano singers, the experimental group was selected as the soprano singers A, B, C, D, E of Korea and analyzed the grammar spectrum and conducted the MOS test for pronunciation recognition. As a result, soprano singer B showed a clear recognition from F1 to F5 and MOS test result showed the highest recognition rate with 4.6 points. Soprano singers A, C, and D appear from F1 to F3, but it was difficult to find formants above 2kHz. Finally, the soprano singer E had difficulty in finding the formant as a whole, and MOS test showed the lowest recognition rate at 2.1 points. Therefore, we confirmed that the soprano singer B, which exhibits the most distinct formant characteristics in the grammar spectrum, has the best pronunciation accuracy.

인공와우 이식 시기에 따른 모음의 음향음성학적 특성 (Acoustic Characteristics of Some Vowels Produced by the CI Children of Various Age Groups)

  • 김고은;고도흥
    • 음성과학
    • /
    • 제14권4호
    • /
    • pp.203-212
    • /
    • 2007
  • This study was to compare some acoustic characteristics of vowels produced by children with cochlear implant (CI) and the children with normal hearing. 20 subjects under ten years old were further classified into two groups (one group of CI children under four years old and the other group of CI children over four years old). For the normal hearing group, 20 subjects are participated in the experiment. Some acoustic parameters including fundamental frequency (F0) and formant frequencies (F1, F2) were measured in the two groups according to the age of cochlear implant operation. For the CI group, three comer vowels (/a/, /i/, /u/) were recorded five times in isolation and analyzed with Multi-Speech (Kay Elemetrics, model 3700), and two independent t-tests on their formant data were conducted using SPSS 11.5. The result showed that the implanted group over four years had a significant difference in F0 and F1 comparing with the implanted group under four years of age as well as the normal hearing group. Those values of the children with the implanted group under four years old were closer to those of the children with the normal hearing. As to the F2, there was no significant difference among implanted groups. However, it was shown that the vowel space for the implanted groups regardless the operation age indicated much smaller than that for the normal hearing children. This acoustic results suggest that CI surgery would be much more effective if it is done under the age of four years old.

  • PDF

성대형태 및 음향발현에서 성악 발성 및 판소리 발성의 비교 연구 (A Comparative Study of Western Singer's Voice and a Pansori Singer's Voice Based on Glottal Image and Acoustic Characteristics)

  • 김선숙
    • 음성과학
    • /
    • 제11권2호
    • /
    • pp.165-177
    • /
    • 2004
  • Western singers voice have been studied in music science since the early 20th century. However, Korean traditional singers voice have not yet been studied scientifically. This study is to find the physiological and acoustic characteristics of Pansori singers voices. Western singers participated for comparative purposes. Ten western singers and ten Pansori singers participated in this study. The subjects spoke and sung seven simple vowels /a, e, i, o, u, c, w/. An analysis of Glottal image was done by Scope View and acoustic characteristics of speech and singing voice were analyzed by CSL. The results are as follows: (1) Glottal gestures of Pansori singers showed asymmetric vocal folds. (2) Singing vowel formants of Pansori singers showed breathiness based on Spectrogram. (3) Music formant of western singers appeared in around 3kHz area, however, Pansori singers formant appeared in low frequency area. Modulation of vibrato showed 6 frequency per sec in case of western singers. Pansori singers showed no deep modulation of vibrato on spectrogram.

  • PDF