• Title/Summary/Keyword: Formant analysis

Search Result 191, Processing Time 0.032 seconds

A Study on Monitoring of Liver Function Based on Voice Signal Analysis for u-Health System (u-Health 시스템을 위한 음성신호 분석 기반의 간 기능 모니터링에 관한 연구)

  • Kim, Bong-Hyun;Cho, Dong-Uk
    • The KIPS Transactions:PartB
    • /
    • v.18B no.6
    • /
    • pp.389-396
    • /
    • 2011
  • There is getting worse to various liver diseases due to change in eating habits, stress, alcohol etc in modern society. Therefore, we proposed methodology to diagnose early for liver disease to study the influence on voice in liver diseases. To this end, we carried out experiment to apply parameter of voice analysis to collect each voice inpatients and patients by treatment of liver diseases patients. Particularly, we carried out experiment to apply element value of pronunciation and the third formant frequency bandwidths about velar sounds associated liver in oriental medicine, then to produce objective index resonance cavity and influence vocalization in liver diseases. In addition, we carried out to study about design of system to monitoring a liver function in u-Health environment based on result by experiment.

AN ACOUSTIC ANALYSIS ON THE PRONUNCIATION OF KOREAN VOWELS IN PATIENT WITH CLASS III MALOCCLUSION (III급 부정교합 환자의 한국어 모음 발음에 관한 음향학적 분석)

  • Kim, Young-Ho;Yoo, Hyun-Ji;Kim, Whi-Young;Hong, Jong-Rak
    • Journal of the Korean Association of Oral and Maxillofacial Surgeons
    • /
    • v.35 no.4
    • /
    • pp.221-228
    • /
    • 2009
  • The purpose of the study was to investigate the characteristics of the pronunciation of Korean vowels in patients with class III malocclusion. 11 adult male patients with class III malocclusion(mean ages 22.3 years) and four adult males with normal occlusion(mean ages 26.5 years) were selected for the analysis of eight Korean monophthongs /ㅣ, ㅔ, ㅐ, ㅏ, ㅓ, ㅗ, ㅡ, ㅜ/. The values and relationships of F1, F2 and F3 were derived from the stable section of target vowel in each sentence, and the analysis using formant plots and vowel triangles' distance and area was conducted to find the features of two groups' vowel distributions. Consequently, it was identified that the pronunciation of males patients with class III malocclusion showed high values of F1 in the low vowels, high values of F2 in the back vowels, and remarkably low position of /ㅏ/. The vowel triangle suggested that the triangle areas of male patients with class III malocclusion were shown wider vertically and narrower horizontally than those of males with normal occlusion. These characteristics could reflect the structural features of class III malocclusion such as the prognathic mandible, low tongue position, and advancement of back position of the tongue.

Contrastive Analysis of Mongolian and Korean Monophthongs Based on Acoustic Experiment (음향 실험을 기초로 한 몽골어와 한국어의 단모음 대조분석)

  • Yi, Joong-Jin
    • Phonetics and Speech Sciences
    • /
    • v.2 no.2
    • /
    • pp.3-16
    • /
    • 2010
  • This study aims at setting the hierarchy of difficulty of the 7 Korean monophthongs for Mongolian learners of Korean according to Prator's theory based on the Contrastive Analysis Hypothesis. In addition to that, it will be shown that the difficulties and errors for Mongolian learners of Korean as a second or foreign language proceed directly from this hierarchy of difficulty. This study began by looking at the speeches of 60 Mongolians for Mongolian monophthongs; data were investigated and analyzed into formant frequencies F1 and F2 of each vowel. Then, the 7 Korean monophthongs were compared with the resultant Mongolian formant values and are assigned to 3 levels, 'same', 'similar' or 'different sound'. The findings in assessing the differences of the 8 nearest equivalents of Korean and Mongolian vowels are as follows: First, Korean /a/ and /$\wedge$/ turned out as a 'same sound' with their counterparts, Mongolian /a/ and /ɔ/. Second, Korean /i/, /e/, /o/, /u/ turned out as a 'similar sound' with each their Mongolian counterparts /i/, /e/, /o/, /u/. Third, Korean /ɨ/ which is nearest to Mongolian /i/ in terms of phonetic features seriously differs from it and is thus assigned to 'different sound'. And lastly, Mongolian /$\mho$/ turned out as a 'different sound' with its nearest counterpart, Korean /u/. Based on these findings the hierarchy of difficulty was constructed. Firstly, 4 Korean monophthongs /a/, /$\wedge$/, /i/, /e/ would be Level 0(Transfer); they would be transferred positively from their Mongolian counterparts when Mongolians learn Korean. Secondly, Korean /o/, /u/ would be Level 5(Split); they would require the Mongolian learner to make a new distinction and cause interference in learning the Korean language because Mongolian /o/, /u/ each have 2 similar counterpart sounds; Korean /o, u/, /u, o/. Thirdly, Korean /ɨ/ which is not in the Mongolian vowel system will be Level 4(Overdifferentiation); the new vowel /ɨ/ which bears little similarity to Mongolian /i/, must be learned entirely anew and will cause much difficulty for Mongolian learners in speaking and writing Korean. And lastly, Mongolian /$\mho$/ will be Level 2(Underdifferentiation); it is absent in the Korean language and doesn‘t cause interference in learning Korean as long as Mongolian learners avoid using it.

  • PDF

Study of Emotion in Speech (감정변화에 따른 음성정보 분석에 관한 연구)

  • 장인창;박미경;김태수;박면웅
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2004.10a
    • /
    • pp.1123-1126
    • /
    • 2004
  • Recognizing emotion in speech is required lots of spoken language corpus not only at the different emotional statues, but also in individual languages. In this paper, we focused on the changes speech signals in different emotions. We compared the features of speech information like formant and pitch according to the 4 emotions (normal, happiness, sadness, anger). In Korean, pitch data on monophthongs changed in each emotion. Therefore we suggested the suitable analysis techniques using these features to recognize emotions in Korean.

  • PDF

Speech signal processing in the auditory system (청각 계통에서의 음성신호처리)

  • 이재혁;심재성;백승화;박상희
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1987.10b
    • /
    • pp.680-683
    • /
    • 1987
  • The speech signal processing in the auditory system can be analysized based on two representations : Average discharge rate and Temporal discharge pattern. But the average discharge rate representation is restricted by the narrow dynamic range because of the rate saturation and the two tone suppression phenomena, and the temporal discharge pattern representation needs a sophisticate frequency analysis and synchrony measure. In this paper, a simple representation is proposed : using a model considering the interaction of Cochlear fluid-BM movement and a haircell model, the feature of speech signals (formant frequency and pitch of vowels) is easily estimated in the Average Synchronized Rate.

  • PDF

Comparison of the Dynamic Time Warping Algorithm for Spoken Korean Isolated Digits Recognition (한국어 단독 숫자음 인식을 위한 DTW 알고리즘의 비교)

  • 홍진우;김순협
    • The Journal of the Acoustical Society of Korea
    • /
    • v.3 no.1
    • /
    • pp.25-35
    • /
    • 1984
  • This paper analysis the Dynamic Time Warping algorithms for time normalization of speech pattern and discusses the Dynamic Programming algorithm for spoken Korean isolated digits recognition. In the DP matching, feature vectors of the reference and test pattern are consisted of first three formant frequencies extracted by power spectrum density estimation algorithm of the ARMA model. The major differences in the various DTW algorithms include the global path constrains, the local continuity constraints on the path, and the distance weighting/normalization used to give the overall minimum distance. The performance criterias to evaluate these DP algorithms are memory requirement, speed of implementation, and recognition accuracy.

  • PDF

An Experimental Phonetic Analysis on Japanese Vowels of Japanese Natives (일본인 화자의 일본어 모음에 관한 실험음성학적 분석)

  • Lee Jae-Gang
    • MALSORI
    • /
    • no.33_34
    • /
    • pp.57-69
    • /
    • 1997
  • In this paper, 1 will try to examine the aspects of formants, based on the LPC analysis. In this analysis, five Japanese vowels (a, i, u, e, o) will experience two kinds of experiments: vowels in isolated forms, and vowels in carrier sentences. The analysis results of Japanese vowels of the Japanese natives show a peculiar feature that Japanese vowels form respective vowel groups. Each Japanese vowel makes a statistically significant difference. In the Fl analysis of the vowels grouped by the informant's sex, Japanese vowel (a) shows the greatest standard deviation without regard to the informant's sex. In the F2 analysis of Japanese vowels, each vowel has a statistically significant difference. The fact that the male's [u] shows great standard deviation means that there is a great difference of the frontness of the tongue among the Japanese males in articulating [u]. Isolated vowels and carried vowels show statistically little significance between Fl and F2 frequency values. In another contrastive analysis between the isolated vowel group and the carried vowel group, whether a vowel is articulated in isolation or in a sentence appears to have little effect on its formant frequency.

  • PDF

A Study on the Channel Normalized Pitch Synchronous Cepstrum for Speaker Recognition (채널에 강인한 화자 인식을 위한 채널 정규화 피치 동기 켑스트럼에 관한 연구)

  • 김유진;정재호
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.1
    • /
    • pp.61-74
    • /
    • 2004
  • In this paper, a contort- and speaker-dependent cepstrum extraction method and a channel normalization method for minimizing the loss of speaker characteristics in the cepstrum were proposed for a robust speaker recognition system over the channel. The proposed extraction method creates a cepstrum based on the pitch synchronous analysis using the inherent pitch of the speaker. Therefore, the cepstrum called the 〃pitch synchronous cepstrum〃 (PSC) represents the impulse response of the vocal tract more accurately in voiced speech. And the PSC can compensate for channel distortion because the pitch is more robust in a channel environment than the spectrum of speech. And the proposed channel normalization method, the 〃formant-broadened pitch synchronous CMS〃 (FBPSCMS), applies the Formant-Broadened CMS to the PSC and improves the accuracy of the intraframe processing. We compared the text-independent closed-set speaker identification on 56 females and 112 males using TIMIT and NTIMIT database, respectively. The results show that pitch synchronous km improves the error reduction rate by up to 7.7% in comparison with conventional short-time cepstrum and the error rates of the FBPSCMS are more stable and lower than those of pole-filtered CMS.

Fundamental Acoustic Investigation of Korean Male 5 Monophthongs (한국 남성의 단모음 [아, 에, 이, 오, 우]에 대한 음향음성학적 기반연구)

  • Choi, Yae-Lin
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.6
    • /
    • pp.373-377
    • /
    • 2010
  • Numerous quantitative and qualitative studies have already been published related to English vowels. However, only minimal amounts of studies based on the acoustic analysis of Korean vowels have been accomplished. The purpose of this study is to obtain sufficient quantitative data based on the acoustic aspects of Korean vowels produced by males between the ages of 20s and 30s. A total of 31 males in their 20s and 30s produced the five fundamental vowels /a, e, i, o, u/ by repeating each of them three times in the standard Korean dialect. Such speech productions were recorded with 'Cool edit' and F1, F2, F3, F4 were extracted through the MATLAB acoustic analysis program. Results indicated that the overall patterns of formants were similar to previous studies, except that the formant levels of F1 and F2 of the vowels produced in this study were generally lower than that in previous studies. Future studies need to focus on obtaining vowel data by considering other factors such as age and other speech materials.

The Patterns of Vowel Insertion in Korean Speakers' Production of English C+/l/ and C+/r/ Clusters

  • Kang, Seo-Yoon
    • Phonetics and Speech Sciences
    • /
    • v.4 no.4
    • /
    • pp.3-17
    • /
    • 2012
  • This study examines Korean speakers' production of English consonant clusters, focusing on vowel insertion. An acoustic analysis along with a statistical test was carried out to see what factors are involved in this production. The following factors were considered in the present study: phonetic properties, L1 transfer, and cluster types. Specifically, liquid types were considered to see if they cause any difference depending on C+/l/ or C+/r/ clusters in the onset in terms of vowel insertion patterns. That is, it was examined which Korean speakers produce better, C+/l/ or C+/r/ clusters. Interestingly, the result of the present experiment shows that the correct answer percent was higher in the C+/r/ onset clusters than C+/l/ onset clusters unlike Eckman's (1977) Marked Differential Hypothesis. In other words, the occurrence of the vowel insertion in C+/l/ clusters is higher than C+/r/ onset clusters. This may be attributed to L1 transfer. Furthermore, in the present study, three patterns of vowel insertion in the C+/l/ clusters were identified by implementing an acoustic analysis based on vowel duration and formant: a) vowel insertion with gemination, b) phonological epenthesis, and c) phonetic intrusion. However, phonetic intrusion mainly occurred in the C+/r/ clusters. Data were collected from 54 Korean speakers to see what factors are involved in vowel insertion patterns in the production of English consonant clusters. This study provides evidence for L1 transfer, the duration effect of /l/ in a different context, and three kinds of vowel insertion patterns in conjunction with gestural coordination by age groups.