DOI QR코드

DOI QR Code

A study on the voiceless plosives from the English and Korean spontaneous speech corpus

영어와 한국어 자연발화 음성 코퍼스에서의 무성 파열음 연구

  • Yoon, Kyuchul (Department of English Language & Literature, Yeungnam University)
  • 윤규철 (영남대학교 영어영문학과)
  • Received : 2019.10.16
  • Accepted : 2019.11.30
  • Published : 2019.12.31

Abstract

The purpose of this work was to examine the factors affecting the identities of the voiceless plosives, i.e. English [p, t, k] and Korean [ph, th, kh], from the spontaneous speech corpora. The factors were automatically extracted by a Praat script and the percent correctness of the discriminant analyses was incrementally assessed by increasing the number of factors used in predicting the identities of the plosives. The factors included the spectral moments and tilts of the plosive release bursts, the post-burst aspirations and the vowel onsets, the durations such as the closure durations and the voice onset times (VOTs), the locations within words and utterances and the identities of the following vowels. The results showed that as the number of factors increased up to five, so did the percent correctness of the analyses, resulting in 74.6% for English and 66.4% for Korean. However, the optimal number of factors for the maximum percent correctness was four, i.e. the spectral moments and tilts of the release bursts and the following vowels, the closure durations and the VOTs. This suggests that the identities of the voiceless plosives are mostly determined by their internal and vowel onset cues.

본 논문의 목적은 자연발화 음성 코퍼스를 대상으로 영어 무성 파열음 [p, t, k]과 한국어 격음 파열음 [ph, th, kh]의 조음위치 결정에 영향을 미치는 요인들을 살펴보는 것이다. 프랏 스크립트를 이용하여 요인들은 자동 추출하였고, 판별분석을 통해 요인의 수를 점차 증가시켜가면서 무성 파열음의 예측 정확도를 계산하였다. 분석에 사용된 요인들은 개방파열, 파열 후 기식음과 모음 시작 부분의 운동량과 스펙트럼 기울기, 폐쇄구간과 VOT, 단어와 발화 내 위치, 마지막으로 직후 모음의 종류 등이었다. 분석 결과에 따르면, 요인의 수가 다섯 개까지 증가하는 경우 예측정확도가 최대로 증가하여 영어는 74.6%, 한국어는 66.4%를 나타내었다. 그러나 사실상의 최대값에 도달하는 데는 네 개의 요인으로도 충분하였고, 이들은 개방파열과 직후 모음의 운동량과 스펙트럼 기울기, 폐쇄구간과 VOT였다. 이는 무성파열음의 조음위치가 자신의 내부 요인들과 직후 모음의 영향을 동시에 받는다는 것을 의미한다고 볼 수 있다.

Keywords

References

  1. Alwan, A. (1989). Perceptual cues for place of articulation for the voiced pharyngeal and uvular consonants. The Journal of the Acoustical Society of America, 86(2), 549-556. https://doi.org/10.1121/1.398234
  2. Auzou, P., Ozsancak, C., Morris, R. J., Jan, M., Eustache, F., & Hannequin, D. (2000). Voice onset time in aphasia, apraxia of speech and dysarthria: A review. Clinical Linguistics and Phonetics, 14(2), 131-150. https://doi.org/10.1080/026992000298878
  3. Blumstein, S. E., & Stevens, K. N. (1979). Acoustic invariance in speech production: Evidence from measurements of the spectral characteristics of stop consonants. The Journal of the Acoustical Society of America, 66(4), 1001-1017. https://doi.org/10.1121/1.383319
  4. Boersma, P. (2002). Praat, a system for doing phonetics by computer. Glot International, 5(9/10), 341-345.
  5. Byrd, D. (1993). 54,000 American stops. UCLA Working Papers in Phonetics, 83, 97-115.
  6. Bonneau, A., Djezzar, L., & Laprie, Y. (1996). Perception of the place of articulation of French stop bursts. The Journal of the Acoustical Society of America, 100(1), 555-564. https://doi.org/10.1121/1.415866
  7. Crystal T. H., & House, A. S. (1988). The duration of American- English stop consonants: An overview. Journal of Phonetics, 16(3), 285-294. https://doi.org/10.1016/S0095-4470(19)30503-0
  8. Delattre, P. C., Liberman, A. M., & Cooper, F. S. (1955). Acoustic loci and transitional cues for consonants. The Journal of the Acoustical Society of America, 27(4), 769-773. https://doi.org/10.1121/1.1908024
  9. Forrest, K., Weismer, G., Milenkovic, P., & Dougall, R. N. (1988). Statistical analysis of word-initial voiceless obstruents: Preliminary data. The Journal of the Acoustical Society of America, 84(1), 115-123. https://doi.org/10.1121/1.396977
  10. Halle, M., Hughes, G. W., & Radley, J.-P. A. (1957). Acoustic properties of stop consonants. The Journal of the Acoustical Society of America, 29, 107-116. https://doi.org/10.1121/1.1908634
  11. Hwang, S., & Yoon, K. (2017). A study on the release burst spectra of the voiceless plosives from the English and Korean spontaneous speech corpus. Phonetics and Speech Sciences, 9(4), 27-34. https://doi.org/10.13064/KSSS.2017.9.1.027
  12. Kent, R. D., & Read, C. (2002). The acoustic analysis of speech (2nd ed.). Albany, NY: Singular Thomson Learning.
  13. Lee, Y., & Yoon, K. (2016). A study on the voice onset times of the Seoul corpus males in their twenties. Phonetics and Speech Sciences, 8(4), 1-8. https://doi.org/10.13064/KSSS.2016.8.4.001
  14. Pae, J., Shin, J., & Ko, D. H. (1999). Some acoustical aspects of Korean stops in various utterance positions: Focusing on their temporal characteristics. Korean Journal of Speech Sciences, 5(2), 139-159.
  15. Pitt, M. A., Dilley, L., Johnson, K., Kiesling, S., Raymond, W., Hume, E., & Fosler-Lussier, E. (2007). Buckeye Corpus of Conversational Speech (2nd release). Columbus, OH: Department of Psychology, Ohio State University. Retrieved from http://www.buckeyecorpus.osu.edu
  16. RStudio Team. (2015). RStudio: Integrated development for R [Computer software]. Boston, MA: RStudio. Retrieved from http://www.rstudio.com/ on March 31, 2016.
  17. Shin, J. (1997). Consonantal production and coarticulation in Korean (Doctoral dissertation). University of London, London, UK.
  18. Smits, R., ten Bosch, L., & Collier, R. (1996). Evaluation of various sets of acoustic cues for the perception of prevocalic stop consonants. I. Perception experiment. The Journal of the Acoustical Society of America, 100(6), 3582-3864.
  19. Stevens, K. N., & Blumstein, S. E. (1975). Quantal aspects of consonant production and perception: A study of retroflex stop consonants. Journal of Phonetics, 3(4), 215-233. https://doi.org/10.1016/S0095-4470(19)31431-7
  20. Stevens, K. N., & Blumstein, S. E. (1978). Invariant cues for place of articulation in stop consonants. The Journal of the Acoustical Society of America, 64(5), 1358-1368. https://doi.org/10.1121/1.382102
  21. Winitz, H., Scheib, M. E., & Reeds, J. A. (1972). Identification of stops and vowels for the burst portion of /p, t, k/ isolated from conversational speech. The Journal of the Acoustical Society of America, 51, 1309-1317. https://doi.org/10.1121/1.1912976
  22. Yao, Y. (2007). Closure duration and VOT of word-initial voiceless plosives in English spontaneous connected speech. UC Berkeley Phonology Lab Annual Report, 2007, 183-225.
  23. Yun, W., Yoon, K., Park, S., Lee, J., Cho, S., Kang, D., Byun, K., Hahn, H., & Kim, J. (2015). The Korean corpus of spontaneous speech. Phonetics and Speech Sciences, 7(2), 103-109. https://doi.org/10.13064/KSSS.2015.7.2.103
  24. Zue, V. W. (1976). Acoustic characteristics of stop consonants: A controlled study (Doctoral dissertation). Massachusetts Institute of Technology, Cambridge, MA.