Search | Korea Science

The Study of Voice Perception with Formant Analysis of Two Myna Bird's Voice Imitation (구관조 음성모방의 음향학적 분석을 통한 음성인식에 대한 고찰)

Lee, Ok-Bun;Jeong, Ok-Ran
- Speech Sciences
- /
- v.12 no.2
- /
- pp.121-128
- /
- 2005
This study was an attempt to determine acoustic characteristics in myna bird's notes. Two myna birds' sounds imitating a normal male voice in his late 20's were sampled and analyzed. The analyses included the mean values of F1, F2, F3 and pitch contours. The results were as follows; First, there was a significan difference in the mean values of F1, F2, and F3 in isolatd vowel /a/ and /i/ between the myna birds' sounds and the human voice. However, there was no apparent difference in pitch contour of their formants. Second, there was a difference in pitch contour of their formants in their sentence ('hn-nyung-ha-se-yo?' meaning 'How are you?') production. Namely, the myna birds' pitch contour was located higher than that of the human's.
PDF

A Study on Comparison of Pronunciation Accuracy of Soprano Singers

Song, Uk-Jin;Park, Hyungwoo;Bae, Myung-Jin
- International journal of advanced smart convergence
- /
- v.6 no.2
- /
- pp.59-64
- /
- 2017
There are three sorts of voices of female vocalists: soprano, mezzo-soprano, and contralto according to the transliteration. Among them, the soprano has the highest vocal range. Since the voice is generated through the human vocal tract based on the voice generation model, it is greatly influenced by the vocal tract. The structure of vocal organs differs from person to person, and the formants characteristic of vocalization differ accordingly. The formant characteristic refers to a characteristic in which a specific frequency band appears distinctly due to resonance occurring in each vocal tract in the vocal process. Formant characteristics include personality that occurs in the throat, jaw, lips, and teeth, as well as phonological properties of phonemes. The first formant is the throat, the second formant is the jaw, the third formant and the fourth formant are caused by the resonance phenomenon in the lips and the teeth. Among them, pronunciation is influenced not only by phonological information but also by jaws, lips and teeth. When the mouth is small or the jaw is stiff when pronouncing, pronunciation becomes unclear. Therefore, the higher the accuracy of the pronunciation characteristics, the more clearly the formant characteristics appear in the grammar spectrum. However, many soprano singers can not open their mouths because their jaws, lips, teeth, and facial muscles are rigid to maintain high tones when singing, which makes the pronunciation unclear and thus the formant characteristics become unclear. In this paper, in order to confirm the accuracy of the pronunciation characteristics of soprano singers, the experimental group was selected as the soprano singers A, B, C, D, E of Korea and analyzed the grammar spectrum and conducted the MOS test for pronunciation recognition. As a result, soprano singer B showed a clear recognition from F1 to F5 and MOS test result showed the highest recognition rate with 4.6 points. Soprano singers A, C, and D appear from F1 to F3, but it was difficult to find formants above 2kHz. Finally, the soprano singer E had difficulty in finding the formant as a whole, and MOS test showed the lowest recognition rate at 2.1 points. Therefore, we confirmed that the soprano singer B, which exhibits the most distinct formant characteristics in the grammar spectrum, has the best pronunciation accuracy.
https://doi.org/10.7236/IJASC.2017.6.2.59 인용 PDF KSCI

Experimental Study on the Korean Monophthongs by Vietnamese Advanced Korean Learners. (베트남인 고급 학습자의 한국어 단모음에 대한 실험음성학적 연구)

Jang, Hyejin
- Korean Linguistics
- /
- v.80
- /
- pp.211-234
- /
- 2018
This study aims to research the acoustic properties of Korean and Vietnamese monophthongs by Vietnamese advanced Korean learners, and to discuss the realization of Korean monophthongs compared to Koreans. The Vietnamese advanced Korean learners do not distinguish between /e/ and /${\varepsilon}$/, which are the same as Korean. They pronounce Korean /e(${\varepsilon}$)/ close to /e/ in their native language. In the case of /ʌ/, it is reported that many errors are observed in previous studies. However, /ʌ/ of Vietnamese advanced learners is realized similar to /ʌ/ spoken by Koreans. /ɯ/ of Vietnamese advanced Korean learners is pronounced in the back of the tongue, whereas in the central by Koreans. In the case of /o/ and /u/, there is no significant difference by the Vietnamese advanced Korean learners. /ɯ/ and /u/ are pronounced in relatively front side of the tongue in Korean, but it is not observed in the Vietnamese advanced Korean learners.
https://doi.org/10.20405/kl.2018.08.80.211 인용

The Acoustic Characteristics of Articulation and Phonation in Peritonsillar Abscess (편도외 농양 환자의 발화시 조음 및 음성의 변화)

Choi, Hyun-Jin;Song, Yun-Kyung;Yeo, Jang-Ok;Huh, Se-Hyung;Jin, Sung-Min
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.19 no.2
- /
- pp.133-135
- /
- 2008
Background and Objectives: The voice changes can occur in peritonsillar abscess and the labeling of this changes as a "muffled voice". The aim of this study was to investigate the changes in acoustic feature of voice before and after treatment in patients with peritonsillar abscess. Materials and Method: 12 patients with peritonsillar abscess were enrolled in the study. Acoustic analysis on sustained Korean vowels /a/, /i/ and /u/ were performed before and after treatment. Results: In patients with peritonsillar abscess, the first formant frequency (F1) and second formant frequency (F2) of /a/ were decreased. There was tendency of articulation of back-low vowel /a/ as back-high vowel /u/. F1 of /i/ and /u/ were increased, while F2 were decreased. There was tendency of articulation of front-high vowel /i/ as back-low vowel /a/. The third, forth, fifth formant frequency (F3, F4, F5) of /a/, /i/ and /u/ were decreased although statistically not significant. Conclusion: The anatomical and functional changes of oropharynx by peritonsillar abscess can cause changes in resonance and speech quality. We suggest that these changes could be the cause of 'muffled voice' in patients of peritonsillar abscess.
PDF

Perceptual Dimensions of Korean Vowel: A Link between Perception and Production (한국어 모음의 지각적 차원 -지각과 산출간의 연동-)

Choi, Yang-Gyu
- Speech Sciences
- /
- v.8 no.2
- /
- pp.181-191
- /
- 2001
The acoustic quality of a vowel is known to be mostly determined by the frequencies of the first formant(Fl) and the second formant(F2). The perceptual(or psychological) dimensions of vowel perception were examined in this study. Also the relationships among perceptual dimensions, acoustical dimensions(Fl & F2), and articulatory gestures of vowel were discussed. Using multi-dimensional scaling(MDS) technique, the experiment was performed in order to identify the perceptual dimensions of the perception of Korean vowel. In the experiment 8 Seoul standard speakers performed the similarity rating task of 10 synthesized Korean vowels. Two-dimensional MDS solution based. on the similarity rating scores was obtained. The results showed that two perceptual dimensions, D1 and D2 were correlated strongly with F2 and F1(r = -.895 and .878 respectively), and were so interpreted as 'vowel advancement' and 'vowel height' respectively. The relationship between the perceptual dimensions of vowel and the articulatory positions of tongue suggested that perception may be directly linked to production. Further research problems were discussed in the .final section.
PDF

Long Term Average Spectral Analysis for Acoustical Discrimination of Korean Nasal Consonants (한국어 비음의 음향학적 구분을 위한 장구간 스펙트럼(LTAS) 분석)

Choi, Soon-Ai;Seong, Cheol-Jae
- MALSORI
- /
- no.60
- /
- pp.67-84
- /
- 2006
The purpose of this study is to find some acoustic parameters on frequency domain to distinguish the Korean nasals, $/m,\;n,\;{\eta}/$ from each other. The new parameters are devised on the basis of LTAS (Long Term Average Spectrum). The maximum peak amplitude and the relevant formant frequency are measured in low and high frequency range, respectively. The frequency of spectral valley and its energy level are also obtained in the specific frequency range of the spectrum. Spectral slope, total energy value in specific frequency range, statistical distribution of spectral energy like centroid, skewness, and kurtosis are suggested as new parameters as well. The parameters that show statistically significant differences across nasals are summerized as follows. 1) in syllable initial positions: the total energy value from 1,500 to 2,200 Hz(zeroENG); 2) in syllable final positions: the peak amplitude of the first formant(peak1_a), the formant frequency with maximum peak amplitude from 4,000 to 8,000 Hz(peak2_f), the maximum peak amplitude of the formant frequency from 4,000 to 8,000 Hz(peak2_a), and the total energy value from 1,500 to 2,200 Hz(zeroENG).
PDF

Acoustic Comparisons of Vowel and Plosive Productions between the Normal and the Hearing-Impaired Children (청각장애아동과 건청아동의 모음 및 파열음 산출의 음향음성학적 특성 비교)

Oh, Y.J.;Zhi, M.Z.;Kim, Y.T.
- Speech Sciences
- /
- v.7 no.2
- /
- pp.51-70
- /
- 2000
Twenty normal and 20 severe-to-profound hearing-impaired subjects participated in the present study. The two groups are matched by their chronological age. Each subject made a recording of three vowels of /i/, /a/, and /u/, and nine $VC_{plosive}V$ (hereafter, VCV) disyllables of /epe/, /ep'e/, /$ep^{h}e$/, /ete/, /et'e/, /$et^{h}e$/, /eke/, /ek'e/, and /$ek^{h}e$/, each five times. Formant frequencies of $F_1,\;F_2,\;and\;F_3$ were measured for the three vowels and six measures were made for the nine disyllables. The six measures were (1) the total duration of the disyllable, (2) the duration of the first vowel, (3) the duration of the closed period, (4) the ratio of the first vowel over the first vowel plus the closure period of the consonant, (5) the duration of the aspiration, and (6) the duration of the second vowel. Results shows that the three formants and each of the measures were significantly different between the two groups of subjects.
PDF

A Comparative Study of Relative Distances among English Front Vowels Produced by Korean and American Speakers (한국인과 미국인이 발화한 영어전설모음의 상대적 거리 비교)

Yang, Byunggon
- Phonetics and Speech Sciences
- /
- v.5 no.4
- /
- pp.99-107
- /
- 2013
The purpose of this study is to examine the relative distances among English front vowels in a message produced by 47 Korean and American speakers in order to better instruct pronunciation skills of English vowels for Korean English learners. A Praat script was developed to collect the first and second formant values(F1 and F2) of eight words in each sound file which was recorded from an internet speech archive. Then, the Euclidean distances were measured between the three vowel pairs: [i-ɛ], [i-ɪ], and [ɛ-æ]. The first vowel pair [i-ɛ] was set as the reference from which the relative distances of the other two vowel pairs were measured in percent in order to compare the vowel sounds among speakers of different vocal tract lengths. Results show that F1 values of the front vowels produced by the Korean and American speakers increased from the high front vowel to the low front vowel wih differences among the groups. The Korean speakers generally produced the front vowels with smaller jaw openings than the American speakers did. Secondly, the relative distance of the high front vowel pair [i-ɪ] showed a significant difference between the Korean and American speakers while that of the low front vowel pair [ɛ-æ] showed a non-significant difference. Finally, the Korean speakers in the higher proficiency level produced front vowels with higher F1 values than those in the lower proficiency level. The author concluded that Korean speakers should produce the front high vowels distinctively by securing sufficient relative distance of the formant values. Further studies would be desirable to examine how strong the Korean speakers' English proficiency correlate with the relative distance of target words of comparable productions.
https://doi.org/10.13064/KSSS.2013.5.4.099 인용 PDF

Comparative Study on the Acoustic Characteristics of the Korean Vowel /a/ before and after LMS (후두미세수술 전후 /아/의 음향적 특성 비교)

Hwang, Yeon-Sin;Seong, Cheol-Jae
- MALSORI
- /
- no.67
- /
- pp.33-60
- /
- 2008
The aim of this study is to show the differences in acoustic parameters between a pathological voice /a/ caused by vocal polyp and a normal voice /a/ produced after LMS (Laryngeal Microscopic Surgery). It was expected that voices of two kinds could be analyzed effectively in terms of HNR in specific frequency bands than in all frequency bands. For this study, 10 patients' voice were recorded before and after LMS and then were manipulated in terms of four acoustic parameter. It was found out that (a) frequency bands of 500Hz in the range of 1,000Hz to 4,000Hz were very useful to obtain HNR values; (b) frequency bands in the range of 1,248Hz to 5,500Hz on a log scale were very useful to obtain HNR values; (c) F0 dropped after LMS but not significantly; (d) the bandwidth of the second formant (B2) decreased significantly after LMS, while that of the first formant (B1) decreased after LMS but not significantly.
PDF

Characteristics of 2 to 4 year old Korean children's production of monophthongs and diphthongs (만 2-4세 한국 아동의 단모음과 이중모음 산출 특징)

Song, Inmi;Seong, Cheoljae
- Phonetics and Speech Sciences
- /
- v.10 no.1
- /
- pp.65-74
- /
- 2018
The purpose of this study is to investigate age-specific features of 2;1- to 4;1-year -olds' production of monophthongs and diphthongs through both auditory perceptual analysis and acoustic analysis. Test material included {vowel+'da'} consisting of 7 monophthongs and 10 diphthongs and meaningful words beginning with vowels. The percentage of correct vowels was used for perceptual analysis and Praat(5.2.12) was used for acoustic analysis, analyzing variables related to monophthongs and diphthongs. The results of this study are as follows: First, perceptual analysis showed that children from an age group of 2;1 to 2;8 years showed significant difference in the accuracy level of both monophthongs and diphthongs as compared to those aged 2;9 to 3;4 years and those aged 3;5 to 4;1 years. Second, the results of acoustic analysis provided that formant (F1 and F2) of monophthong, in general, tended to decrease as age increased. In terms of F2 differentiation slope and regression slope, which were diphthong-related variables, the age group of 3;5 to 4;1 years showed a large general slope change.
https://doi.org/10.13064/KSSS.2018.10.1.065 인용 PDF KSCI

Search Result 44, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)