• Title/Summary/Keyword: Formant analysis

Search Result 191, Processing Time 0.028 seconds

A study on speech analysis of person with presbycusis (노인성 난청인의 음성특성에 관한 연구)

  • Lee, S.M.;Song, C.G.;Woo, H.C.;Lee, Y.M.;Kim, W.K.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1997 no.11
    • /
    • pp.67-70
    • /
    • 1997
  • In this paper, we evaluated the character of speech of hearing impaired person (HIP) who acquire his hearing loss after the youth. It is usually observed that severe HIP decreased not only speech perception but also vocalization. so there is a need for sensitive and quantitative measures or the assesment of the speech of the HIP to serve both diagnostic and prognosic purposes, 7 HIP and 12 normal hearing person(NHP) were studied with pure tone test and speaking test using word/sentence table which consists of vowel(a:), mono and two syllables and a sentence. we analyzed formant frequency, pitch, sound intensity, speech duration of HIP and NHP speech. According to the results, in the HIP's speech we find that formant frequency was shifted, first-formant prominence was reduced, the dynamic range of sound intensity was decreased, speech duration was prolonged. In the next, we expect the correlation between hearing and speech character of HIP is cleared through analysis of more acoustic parameters and precise selection of HIP group.

  • PDF

Sound parameters for classifying individual sows(Landrace×Yorkshire) during nursing behavior (수유행동시 모돈(랜드레이스×요크셔) 발성음의 개체 판별을 위한 음성 파라미터)

  • Jeon, Jung-Hwan;Chang, Hong-Hee;Ha, Jeung-Key;Kim, Hyeon-Hui;Koo, Ja-Min;Lee, Hyo-Jong;Yeon, Seong-Chan
    • Korean Journal of Veterinary Research
    • /
    • v.43 no.1
    • /
    • pp.165-169
    • /
    • 2003
  • The aim of the present study was to analyse grunts of the sows and to extract parameters from the time and frequency signals in nursing behavior. Five crossbred $Landrace{\times}Yorkshire$ sows were used on day 5 or 6 postpartum. The grunts and the behaviors of the five sows were recorded with five digital camcorders. Three parameter groups [Group I: Formant vector alone, Group II: Formant vector+parameters from time signal, Group III: Formant vector+parameters from time signal-parameters eliminated by stepwise discriminant analysis backward (SDAB)] with parameter vectors extracted from single grunts in the maximum grunting rate period were used for individuality of the sows. The parameter groups were compared by a discriminant function analysis. The classification system adopted in the Group II represented the higher discriniation rate than those in other groups (Group I: 63.3%, Group II: 83.0%, Group III: 80.0%). This study demonstrated that formant, intensity, and pitch were available sound parameters for individuality of the sows during nursing behavior.

An Acoustic Analysis and Perceptual Study of Korean Vowels Produced by Transgenders and Noraml Adults (성전환자와 정상인이 발성한 모음의 음향분석과 지각실험)

  • Jo, Sung-Mi;Jeong, Ok-Ran
    • Speech Sciences
    • /
    • v.10 no.3
    • /
    • pp.145-155
    • /
    • 2003
  • This study compared $F_{0}$ and the first three formants of eight Korean monophthongs produced by nine transgenders (male to female) to those of eighteen normal adults. Voice analysis was done by Praat (version 4.049). A one-way ANOVA with Tukey HSD post hoc tests were performed to determine statistical differences in $F_{0}$ and formant values obtained from transgenders, and normal male and female subjects. Results indicated that there was no significant difference in $F_{1}$ of /u/, /$\Lambda$/, and /o/, $F_{2}$ of /u/, /$\Lambda$/, and /i/ and $F_{3}$ of /u/ among the 3 groups (transgenders, normal males and normal females). However, in the comparison of transgenders vs. males, a significant difference was observed in $F_{0}$ of /o/, and $F_{2}$ of /i/, /a/, /e/, and /${\ae}$/ and $F_{3}$ of /e/. Furthermore, in the comparison of transgenders vs. females, a significant difference was also observed in $F_{0}$ of all vowels, $F_{1}$ of /i/, /$\alpha$/, /e/, /${\ae}$/, and /i/. $F_{2}$ of /i/, and /${\ae}$/, and $F_{3}$ of /i/, /$\alpha$/, /$\Lambda$/, /e/, /${\ae}$/, /i/, and /o/. Also, perceptual judgment of the transgenders' voice came out somewhat correlated strongly with their $F_{0}$ values but not much with the formant values. It was concluded that the transgenders' acoustic parameters are placed in between those of the normal males and females in. terms of fundamental and formant frequency analyses of vowels. Thus, it was assumed that those differences might stem from the transgenders' original big resonating cavities.

  • PDF

A Study on Voice Analytical the Vocal Cord and Formant Change in the Smoking and Secondhand Smoking Environments (직.간접흡연 환경에서의 성대 및 음형대 변화에 대한 음성 분석학적 연구)

  • Kim, Bong-Hyun;Cho, Dong-Uk
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.36 no.6B
    • /
    • pp.720-727
    • /
    • 2011
  • Modern people has been increased interest about health care and maintenance as emerging well-being and social issues. In particular, the smoking is not good for the recognition much greater importance is the massive spread of the smoking is low. The smoking has much adverse effects body's respiratory and circulatory organ many and it is recognized as a serious danger to our health the smoking as well as secondhand smoking. In this paper, we were carried out study analysis comparison to apply though voice analytical elements techniques have a influence vocal cords and formants in the environment smoking and secondhand smoking. For this purpose, we organized subjects group smoker and nonsmoker in 20's man and to collect voice of the smoke and Secondhand Smoking before after then we carried out study analysis experimental results Pitch, Jitter, Shimmer, 5~8 Formant Frequency.

A Study on a New Pre-emphasis Method Using the Short-Term Energy Difference of Speech Signal (음성 신호의 다구간 에너지 차를 이용한 새로운 프리엠퍼시스 방법에 관한 연구)

  • Kim, Dong-Jun;Kim, Ju-Lee
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.50 no.12
    • /
    • pp.590-596
    • /
    • 2001
  • The pre-emphasis is an essential process for speech signal processing. Widely used two methods are the typical method using a fixed value near unity and te optimal method using the autocorrelation ratio of the signal. This study proposes a new pre-emphasis method using the short-term energy difference of speech signal, which can effectively compensate the glottal source characteristics and lip radiation characteristics. Using the proposed pre-emphasis, speech analysis, such as spectrum estimation, formant detection, is performed and the results are compared with those of the conventional two pre-emphasis methods. The speech analysis with 5 single vowels showed that the proposed method enhanced the spectral shapes and gave nearly constant formant frequencies and could escape the overlapping of adjacent two formants. comparison with FFT spectra had verified the above results and showed the accuracy of the proposed method. The computational complexity of the proposed method reduced to about 50% of the optimal method.

  • PDF

The Study for Advancing the Performance of Speaker Verification Algorithm Using Individual Voice Information (개별 음향 정보를 이용한 화자 확인 알고리즘 성능향상 연구)

  • Lee, Je-Young;Kang, Sun-Mee
    • Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.253-263
    • /
    • 2002
  • In this paper, we propose new algorithm of speaker recognition which identifies the speaker using the information obtained by the intensive speech feature analysis such as pitch, intensity, duration, and formant, which are crucial parameters of individual voice, for candidates of high percentage of wrong recognition in the existing speaker recognition algorithm. For testing the power of discrimination of individual parameter, DTW (Dynamic Time Warping) is used. We newly set the range of threshold which affects the power of discrimination in speech verification such that the candidates in the new range of threshold are finally discriminated in the next stage of sound parameter analysis. In the speaker verification test by using voice DB which consists of secret words of 25 males and 25 females of 8 kHz 16 bit, the algorithm we propose shows about 1% of performance improvement to the existing algorithm.

  • PDF

Long Term Average Spectrum Characteristics of Head and Chest Register Sounds of Western Operatic Singers : Extended Study (성악다들의 목소리에 대한 Long Term Average Spectrum 분석 -$2^{nd}$ Singer's Formant의 존재 가능성에 대하여-)

  • Ban, Jae-Ho;Kwon, Young-Kyung;Jin, Sung-Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.1
    • /
    • pp.31-36
    • /
    • 2004
  • Background and Objectives : It has been shown that the epilaryngeal tube in the human airway is responsible for vocal ring, or the singer's formant. In previous study, authors showed that in trained tenors, besides the conventional singer's formant in the region of ,5500Hz, another energy peak was observed in the region of 8,000Hz. This peak was interpreted as the second resonance of the epilarynx tube. Singers in other voice categories who produce vocal ring are assumed to have the same peak, but no measurements have as yet been made. Materials and Methods : Fifteen tenors, fourteen baritones, seven sopranos and five mezzo sopranos attending the music college, department of vocal music who could reliably produce the head and chest registers were chosen for this study. Each subject was asked to produce an/ah/sound for at least three seconds for the head register sound(tenors ; G4, barions ; E4 sopranos ; F5 and mezzosopranos ; C5) and for the chest register sound (tenors ; C3, baritones ; D3, sopranos ; D4 and Mezzosoprano ; A3). The sound data was analyzed using the Fast Fourier Transform (FFT)-based power spectrum, Long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab (CSL, Kay elemetrics, Model 4300B, USA). Statistical analysis was performed using the Mann-Whitney test of the Statistical Package for Social sciences(SPSS). Results : For head register sounds, a significant increase was seen in the 2,200-3,400Hz region(p<0.05) and the Similar to the head register sounds, there was a significant increase in energy in the four trained singer group compared with the untrained group in the 2,200-3,100Hz region(p<0.05), the 7,800-8,400Hz region(p<0.05) for the chest register sounds. Conclusions : When good vocal production was made for the head and chest registers, an energy peak was observed near 2,500Hz, a frequency already known as the "singer's formant', in all subjects in the study group. Another region of increased energy was observed around 8,000Hz that had not been noticed previously. The authors believe this region to be the second singer's formant.

  • PDF

A COMPUTER ANALYSIS ON THE KOREAN CONSONANT SOUND DISTORTION IN RELATION TO THE PALATAL PLATE THICKNESS -Dentoalveolar and hard palatal consonant- (구개상의 두께에 따른 한국어 자음의 발음 변화에 관한 컴퓨터 분석 - 치조음, 경구개음-)

  • Woo, Yi-Hyung;Choi, Dae-Kyun;Choi, Boo-Byung;Park, Nam-Soo
    • The Journal of Korean Academy of Prosthodontics
    • /
    • v.25 no.1
    • /
    • pp.71-94
    • /
    • 1987
  • This study was carried out to investigate the sound distortion following the alternation of the palatal plate thickness. For this study, 2 healthy male subjects (24-year-old) were selected. Born in Seoul, they both spoke Seoul dialect. First, their sounds of /na(나)/, /da(다)/, /1a(라)/, /ja(자)/, /cha(차)/, /ta(타)/, without inserting plates were recorded, and then the sounds with palatal plates of different thickness were recorded, successively. The plate was fabricated in 3 types, each palatal thickness being 1.0mm, 2.5mm, dentoalveolar portion 2.5mm, other residual portion was 1.0mm, successively. Each type plates named B, C, D-type, in succession. Series of analysis were administered through Computer(16 bit) to analyze the sound distortions. These experiments were analyzed by the LPC (without weighting, pre-weighting, post-weighting) of the consonants, vowels portion, formant frequency of the vowels and word duration of the consonants. The findings led to the following conclusions: 1. There was no correlation of the distortion rate on the 2 informants. 2. Generally, vowels were not affected by the palatal plate thickness in the formant analysis, however, more distortion was detected in the LPC analysis, especially C, D-type plates. 3. Consonants distortion was more evident in the C, D-type plate. 4. The second formant was most disturbed and reduced in the all consonants with insertion of the palatal plate, especially C, D-type plate. 5. Word duration was shortened in the plate inserted(except /ja/, /cha/), especially C, D-type. 6. It was found that dentoalveolar, hard palatal sounds were severely distorted in plate inserted, and they were mainly affected by the dentoalveolar portion thickness. 7. There was correlation between palatal thickness and consonants quality.

  • PDF

A Comparative Study of Western Singer's Voice and a Pansori Singer's Voice Based on Glottal Image and Acoustic Characteristics (성대형태 및 음향발현에서 성악 발성 및 판소리 발성의 비교 연구)

  • Kim, Sun-Sook
    • Speech Sciences
    • /
    • v.11 no.2
    • /
    • pp.165-177
    • /
    • 2004
  • Western singers voice have been studied in music science since the early 20th century. However, Korean traditional singers voice have not yet been studied scientifically. This study is to find the physiological and acoustic characteristics of Pansori singers voices. Western singers participated for comparative purposes. Ten western singers and ten Pansori singers participated in this study. The subjects spoke and sung seven simple vowels /a, e, i, o, u, c, w/. An analysis of Glottal image was done by Scope View and acoustic characteristics of speech and singing voice were analyzed by CSL. The results are as follows: (1) Glottal gestures of Pansori singers showed asymmetric vocal folds. (2) Singing vowel formants of Pansori singers showed breathiness based on Spectrogram. (3) Music formant of western singers appeared in around 3kHz area, however, Pansori singers formant appeared in low frequency area. Modulation of vibrato showed 6 frequency per sec in case of western singers. Pansori singers showed no deep modulation of vibrato on spectrogram.

  • PDF

An Acoustical Comparison of English Tense and Lax Vowels Produced by Korean and American Males (한국인남성과 미국인남성이 발음한 영어 긴장.이완모음의 음향적 비교)

  • Yang, Byung-Gon
    • Speech Sciences
    • /
    • v.15 no.4
    • /
    • pp.19-27
    • /
    • 2008
  • Several studies on the pronunciation of English vowels point out that Korean learners have difficulty distinguishing English tense and lax vowel pairs. The acoustic comparisons of those studies are mostly based on the formant measurement at one time point of a given vowel section. However, the English lax vowels usually show dynamic changes across their syllable peaks and subjects' English levels account for various conflicting results. The purposes of this paper are to compare the temporal duration and dynamic formant tracks of English tense and lax vowel pairs produced by five Korean and five American males. The subjects were graduate students of an American state university. Results showed that both the Korean and American males produced the vowels with comparable durations. The duration of the front tense-lax vowel pair was longer than that of the back vowel pair. From the formant track comparisons, the American males produced the tense and lax pairs much more distinctly than the Korean male speakers. The results suggest that the Korean males should pay attention to the F1 and F2 movements, i.e., the jaw and tongue movements, in order to match those of the American males. Further studies are recommended on the auditorily acceptable ranges of F2 variation for the lax vowels.

  • PDF