• Title/Summary/Keyword: 포먼트 분석

Search Result 49, Processing Time 0.02 seconds

The implementation of Korean adult's optimal formant setting by Praat scripting (성인 포먼트 측정에서의 최적 세팅 구현: Praat software와 관련하여)

  • Park, Jiyeon;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.11 no.4
    • /
    • pp.97-108
    • /
    • 2019
  • An automated Praat script was implemented to measure optimal formant frequencies for adults. Optimal formant analysis could be interpreted to show that the deviation of formant frequency that resulted from the two variously combined setting parameters (maximum formant and number of formants) was minimal. To increase the reliability of formant analysis, LPC order should be set differently, based on the gender or vowel type. Praat recommends 5,000 Hz and 5,500 Hz as maximum formant settings and, at the same time, recommends 5 as the number of formants for males and females. However, verification is needed to determine whether these recommended settings are valid for Korean vowels. Statistical analysis showed that formant frequencies significantly varied across the adapted scripts, especially with respect to the data on females. Formant plots and statistical results showed that linear_script and qtone_script are much more reliable in formant measurements. Among four kinds of scripts, the linear and qtone_scripts proved to be more stable and reliable. While the linear_script was designed to have a linearly increased formant step in for-loop, the increment of formant step in the qtone_script was arranged by quarter tone scale (base frequency×common ratio ($\sqrt[24]{2}$)). When looking at the tendency of the formant setting drawn by the two referred algorithms in the context of front vowel [i, e], the maximum formant was set higher; and the number of formants set at a lower value than recommended by Praat. The back vowel [o, u], on the contrary, has a lower maximum formant and a higher number of formants than the standard setting.

Characteristics of Vowel Formants, Voice Intensity, and Fundamental Frequency of Female with Amyotrophic Lateral Sclerosis using Spectrograms (스펙트로그램을 이용한 근위축성측삭경화증 여성 화자의 모음 포먼트, 음성강도, 기본주파수의 변화)

  • Byeon, Haewon
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.9
    • /
    • pp.193-198
    • /
    • 2019
  • This study analyzed the changes of vowel formant, voice intensity, and fundamental frequency of vowels for 11 months using acoustochemical spectrogram analysis of women diagnosed with amyotrophic lateral sclerosis (ALS). The test word was a vowel /a, i, u/ and a diphthong /h + ja + da/, /h + wi + da/, and /h +ɰi+ da/. Speech data were collected through the word reading task presented on the monitor using 'Alvin' program, and the recording environment was set to 5,500 Hz for the nyquist frequency and 11,000 Hz for the sampling rate. The records were analyzed by using spectrograms to vowel formants, voice intensity, and fundamental frequency. As a result of analysis, the fundamental frequency and intensity of the ALS process were decreased and the formant slope of the diphthong was decreased rather than the formant change in the vowel. This result suggests that the vowel distortion of ALS due to disease progression is due to the decrease of tongue and jaw co morbidity.

Correlation Analysis of Between Paranasal Sinuses and Formant Frequency According to External Stimulation (외부 자극에 따른 부비동과 포먼트주파수와의 상관성 분석)

  • Kim, Bong-Hyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.17 no.8
    • /
    • pp.1955-1961
    • /
    • 2013
  • Paranasal sinuses of the empty space is filled with air that exists in the bones in the face. However, the pus becomes inflamed paranasal sinuses sinusitis onset brings the voice of change, and complained of headaches and lethargy. Therefore, in this paper, paranasal sinuses related diseases to predict voice analysis parameter as measured by changes in paranasal sinuses through external stimuli is investigated and carried out a study to analysis the function consisting of the frontal sinus, ethmoid sinus, maxillary sinus, sphenoid sinus. From this, cold pack stimulation in the paranasal sinus area for stimulation before and after voice was performed by measuring formant frequency and external stimuli through correlation analysis of the mutual impact on paranasal sinuses were analyzed.

Influence of Temporo-mandibular Joint Training Using Physical Therapy on the Vowel Acoustic Characteristics (TM Joint의 물리치료를 통한 훈련이 모음의 음향학적 특성에 미치는 영향)

  • Min, Dong-Gi;Lee, Jae-Hong
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.12 no.5
    • /
    • pp.2203-2208
    • /
    • 2011
  • This study was to examine the change of vowel acoustic characteristics of the temporomandibular joint disorder patients by maintaining normal vocalization pattern of the temporomandibular joint through increasing the range of motion, that was, the oral cavity sonorant cavity of the temporomandibular joint, related to vowel articulation through temporomandibular training using the physical therapy. The subjects of this study were 3 male adults in 20-30s that were diagnosed with temporomandibular joint disorder. As a result of conducting temporomandibular training program using the physical therapy, the $1^{st}$ Formant Frequency(F1), $2^{nd}$ Formant Frequency(F2), and Fundamental Frequency(F0) of the temporomandibular joint disorder patients were increased compared to before and this showed the change of the $1^{st}$ Formant Frequency(F1) related to the open mouth grade of a vowel, as well as the $2^{nd}$ Formant Frequency(F2), and Fundamental Frequency(F0) related to the front-back of a vowel which shows the relationship between the temporomandibular joint, vowels and voice calculation.

Perceptual cues for /o/ and /u/ in Seoul Korean (서울말 /?/와 /?/의 지각특성)

  • Byun, Hi-Gyung
    • Phonetics and Speech Sciences
    • /
    • v.12 no.3
    • /
    • pp.1-14
    • /
    • 2020
  • Previous studies have confirmed that /o/ and /u/ in Seoul Korean are undergoing a merger in the F1/F2 space, especially for female speakers. As a substitute parameter for formants, it is reported that female speakers use phonation (H1-H2) differences to distinguish /o/ from /u/. This study aimed to explore whether H1-H2 values are being used as perceptual cues for /o/-/u/. A perception test was conducted with 35 college students using /o/ and /u/ spoken by 41 females, which overlap considerably in the vowel space. An acoustic analysis of 182 stimuli was also conducted to see if there is any correspondence between production and perception. The identification rate was 89% on average, 86% for /o/, and 91% for /u/. The results confirmed that when /o/ and /u/ cannot be distinguished in the F1/F2 space because they are too close, H1-H2 differences contribute significantly to the separation of the two vowels. However, in perception, this was not the case. H1-H2 values were not significantly involved in the identification process, and the formants (especially F2) were still dominant cues. The study also showed that even though H1-H2 differences are apparent in females' production, males do not use H1-H2 in their production, and both females and males do not use H1-H2 in their perception. It is presumed that H1-H2 has not yet been developed as a perceptual cue for /o/ and /u/.

Acoustic Analysis for Thermal Environment-related Vocalizations in Laying Hens (산란계의 열환경별 특이음에 대한 음성학적 분석)

  • Jeon, J.H.;Yeon, S.C.;Ha, J.K.;Lee, S.J.;Chang, H.H.
    • Journal of Animal Science and Technology
    • /
    • v.47 no.4
    • /
    • pp.697-702
    • /
    • 2005
  • The aim of this study was to divide vocalizations of laying hens (Hy-Line Brown) into general vocalizations (GVs), heat stress-related vocalization (HSV), and cold stress-related vocalizations (CSVs) and to determine if they are classified by the discriminant function analysis method. Thirty laying hens, 65-wk-old, were recorded using digital video recorders 2 times from 10:00 to 14:00 h in each thermal environment (thermoneutral: $22.0{\pm}1.8^{\circ}C$, too hot: $32.0{\pm}2.0^{\circ}C$, too cold: $8.0{\pm}1.9^{\circ}C)$ after a 7 day acclimation period. When the laying hens were not recorded, they were kept in thermoneutral conditions. The GVs, HSV, and CSVs were divided based on the shapes of spectrums and spectrograms. The GVs, HSV, and CSVs were identified as 5, 1, and 3 types, respectively. Pitch, intensity, duration, formant 1, formant 2, formant 3, and formant 4 among the thermal environment-related vocalizations were significantly different (P<0.001). The discrimination rate determined by discriminant function analysis was 86.2%. These results suggest that HSV and CSVs are present and may be used as an indicator of the thermal environment.

A Design of Kidney Diseases Diagnosis Method Using Formant Frequency Bandwidth Extraction and Analysis (포먼트 주파수 대역폭 추출 및 분석을 이용한 신장 질환 진단 방법의 설계)

  • Kim, Bong-Hyun;Cho, Dong-Uk
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.34 no.10B
    • /
    • pp.1062-1069
    • /
    • 2009
  • The kidney diseases is a big social problem what is suffering sequela of metabolic syndrome due to obesity. Therefore, it is most important that early to take the appropriate action; it does not have symptoms Abnormalities of the kidney. With this, in mind, this paper wish to propose the method to can diagnosis by non self-consciousness, non-imprisonment, analgesia of kidney disease through the voice analysis. To configure the entire system is developed to combines the voice analysis, watching the face color and this paper is designed the method to diagnosis kidney disease based on labial. In this paper, organized each kidney disease patients and healthy people group and we would like to analyze, compare with output in experiment morphology analysis and numerical value analysis of voice information. Secondly, auscultation theory of Oriental medicine and linguistic, phonetics analyze out interrelation to extraction peculiar elements of kidney about voice deduction deduced relation of the first formants frequency. Such result of experimentation, deduced widely to be formed the first formants frequency bandwidth value of kidney patients group than normal group. Finally, diagnosing an kidney diseases in only labial sound, calculated about misdiagnosis probability.

How to Express Emotion: Role of Prosody and Voice Quality Parameters (감정 표현 방법: 운율과 음질의 역할)

  • Lee, Sang-Min;Lee, Ho-Joon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.11
    • /
    • pp.159-166
    • /
    • 2014
  • In this paper, we examine the role of emotional acoustic cues including both prosody and voice quality parameters for the modification of a word sense. For the extraction of prosody parameters and voice quality parameters, we used 60 pieces of speech data spoken by six speakers with five different emotional states. We analyzed eight different emotional acoustic cues, and used a discriminant analysis technique in order to find the dominant sequence of acoustic cues. As a result, we found that anger has a close relation with intensity level and 2nd formant bandwidth range; joy has a relative relation with the position of 2nd and 3rd formant values and intensity level; sadness has a strong relation only with prosody cues such as intensity level and pitch level; and fear has a relation with pitch level and 2nd formant value with its bandwidth range. These findings can be used as the guideline for find-tuning an emotional spoken language generation system, because these distinct sequences of acoustic cues reveal the subtle characteristics of each emotional state.

The effect of palatal height on the Korean vowels (구개의 높이가 한국어 모음 발음에 미치는 효과에 관한 연구)

  • Chung, Bo-Yoon;Lim, Young-Jun;Kim, Myung-Joo;Nam, Shin-Eun;Lee, Seung-Pyo;Kwon, Ho-Beom
    • The Journal of Korean Academy of Prosthodontics
    • /
    • v.48 no.1
    • /
    • pp.69-74
    • /
    • 2010
  • Purpose: The purpose of this study was to analyze the influence of palatal height on Korean vowels and speech intelligibility in Korean adults and to produce baseline data for future prosthodontic treatment. Material and methods: Forty one healthy Korean men and women who had no problem in pronunciation, hearing, and communication and had no history of airway disease participated in this study. Subjects were classified into H, M, and L groups after clinical determination of palatal height with study casts. Seven Korean vowels were used as sample vowels and subjects'clear speech sounds were recorded using Multispeech software program on computer. The F1 and the F2 of 3 groups were produced and they were compared. In addition, the vowel working spaces of 3 groups by /a/, /i/, and /u/ corner vowels were obtained and their areas were compared. Kruskal-Wallis test and Mann-Whiteny U test were used as statistical methods and P < .05 was considered statistically significant. Results: There were no significant differences in formant frequencies among 3 groups except for the F2 formant frequency between H and L group (P = .003). In the analysis of vowel working space areas of 3 groups, the vowel working spaces of 3 groups were similar in shape and no significant differences of their areas were found. Conclusion: The palatal height did not affect vowel frequencies in most of the vowels and speech intelligibility. The dynamics of tongue activity seems to compensate the morphological difference.

A Study of Acoustic Masking Effect from Formant Enhancement in Digital Hearing Aid (디지털 보청기에서의 포먼트 강조에 의한 마스킹 효과 연구)

  • Jeon, Yu-Yong;Kil, Se-Kee;Yoon, Kwang-Sub;Lee, Sang-Min
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.45 no.5
    • /
    • pp.13-20
    • /
    • 2008
  • Although digital hearing aid algorithms have been developed to compensate hearing loss and to help hearing impaired people to communicate with others, digital hearing aid user still complain about difficulty of hearing the speech. The reason could be the quality of speech through digital hearing aid is insufficient to understand the speech caused by feedback, residual noise and etc. And another thing is masking effect among formants that makes sound quality low. In this study, we measured the masking characteristics of normal listeners and hearing impaired listeners having presbyacusis to confirm masking effect in speech itself. The experiment is composed of 5 tests; pure tone test, speech reception threshold (SRT) test, word recognition score (WRS) test, puretone masking test and speech masking test. In speech masking test, there are 25 speeches in each speech set. And log likelihood ratio (LLR) is introduced to evaluate the distortion of each speech objectively. As a result, the speech perception became lower by increasing the quantity of formant enhancement. And each enhanced speech in a speech set has statistically similar LLR, however speech perception is not. It means that acoustic masking effect rather than distortion influences speech perception. In actuality, according to the result of frequency analysis of the speech that people can not answer correctly, level difference between first formant and second formant is about 35dB, and it is similar to result of pure tone masking test(normal hearing subject:36.36dB, hearing impaired subject:32.86dB). Characteristics of masking effect is not similar between normal listeners and hearing impaired listeners. So it is required to check the characteristics of masking effect before wearing a hearing aid and to apply this characteristics to fitting.