DOI QR코드

DOI QR Code

Qualitative Classification of Voice Quality of Normal Speech and Derivation of its Correlation with Speech Features

정상 음성의 목소리 특성의 정성적 분류와 음성 특징과의 상관관계 도출

  • Received : 2014.02.05
  • Accepted : 2014.03.11
  • Published : 2014.03.31

Abstract

In this paper voice quality of normal speech is qualitatively classified by five components of breathy, creaky, rough, nasal, and thin/thick voice. To determine whether a correlation exists between a subjective measure of voice and an objective measure of voice, each voice is perceptually evaluated using the 1/2/3 scale by speech processing specialists and acoustically analyzed using speech analysis tools such as the Praat, MDVP, and VoiceSauce. The speech parameters include features related to speech source and vocal tract filter. Statistical analysis uses a two-independent-samples non-parametric test. Experimental results show that statistical analysis identified a significant correlation between the speech feature parameters and the components of voice quality.

Keywords

References

  1. Fex, S. (1992). Perceptual evaluation. Journal of Voice, 6(2), 155-158. https://doi.org/10.1016/S0892-1997(05)80130-4
  2. Sohn, J. H. (2008). GRBAS and voice handicap index. Journal of the Korean Society of Logopedics and Phoniatrics, 19(2), 89-95. (손진호 (2008). GRBAS 음성평가와 음성장애지수, 대한음성 언어의학회지, 19(2), 89-95.)
  3. Pyo, H. Y., Choi, S. H., Lim, S. E., Sim, H. S., Choi, H. S., Kim, K. M. (1999). The correlation between GRBAS scales and MDVP parameters on the pathologic voices of the patients with vocal polyps. Journal of the Korean Society of Logopedics and Phoniatrics, 10(2), 154-163. (표화영, 최성희, 임성은, 심현섭, 최홍식, 김광문 (1999). 성대 폴립 환자를 대상으로 한 GRBAS 척도와 MDVP 측정치 간 의 상관관계 연구. 대한음성언어의학회지, 10(2), 154-163.)
  4. Han, S. M., Kim, S. B., Kim, J. Y., Kwon, C. H. (2011). A preliminary study on correlation between voice characteristics and speech features. Phonetics and Speech Sciences, 3(4), 85-91. (한성만, 김상범, 김종열, 권철홍 (2011). 목소리 특성의 주관 적 평가와 음성 특징과의 상관관계 기초연구. 한국음성학회, 말소리와 음성과학, 3(4), 85-91.)
  5. Kwon, C. H., Song, S. K., Kim, J. Y., Kim, K. H., Jang, J. S. (2012). Extraction of speech features for emotion recognition. Phonetics and Speech Sciences, 4(2), 73-78. (권철홍, 송승규, 김종열, 김근호, 장준수 (2012). 감정 인식을 위한 음성 특징 도출. 한국음성학회, 말소리와 음성과학, 4(2), 73-78.) https://doi.org/10.13064/KSSS.2012.4.2.073
  6. Hwang, Y. S., Seong, C. J. (2008). A comparative study on the acoustic characteristics of the Korean vowel /a/ before and after LMS. Malsori, 67, 33-60. (황연신, 성철재 (2008). 후두미세수술 전후 /아/의 음향적 특성 비교. 대한음성학회, 말소리, 67, 33-60.)
  7. Hillenbrand, J. & Houde, R. A. (1996). Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. Journal of Speech and Hearing Research, 39, 311-321. https://doi.org/10.1044/jshr.3902.311
  8. Park, H. S. (2007). An acoustic study of phonation types in vowels following consonant clusters in Korean. Malsori, 64, 53-76. (박한상 (2007). 한국어 자음군의 후행모음에 나타난 발성유 형의 음향음성학적 연구. 대한음성학회, 말소, 64, 53-76.)
  9. Praat. (2014). Phonetic Sciences, Univ. of Amsterdam, http://www.fon.hum.uva.nl/praat/.
  10. MDVP. (2014). KayPentax, http://www.kayelemetrics.com.
  11. VoiceSauce. (2014). A program for voice analysis, UCLA, http://www.seas.ucla.edu/spapl/voicesauce/.
  12. Seo, E. H. (2010). Statistical analysis using SPSS 18.0. Free Academy, 403-415. (서의훈 (2010). SPSS 18.0을 이용한 통계분석, 자유아카데미, 403-415.)
  13. IBM SPSS Statistics. (2014). SPSS Korea, http://www.spss.co.kr.
  14. Bickley, C. (1982). Acoustic analysis and perception of breathy vowels. Speech Communication Group Working Papers I, 73-83.
  15. Klatt, D. H. & Klatt, L. C. (1990). Analysis, synthesis, and perception of voice quality variations among female and male speakers. Journal of the Acoustical Society of America, 87(2), 820-857. https://doi.org/10.1121/1.398894
  16. Mathew, M. M. & Bhat, J. S. (2009). Soft phonation index - a sensitive parameter?. Indian Journal of Otolaryngology Head Neck Surg., 61, 127-130. https://doi.org/10.1007/s12070-009-0050-4
  17. Hillenbrand, J., Cleveland, R. A., Erickson, R. L. (1994). Acoustic correlates of breathy vocal quality. Journal of Speech and Hearing Research, 37, 769-778. https://doi.org/10.1044/jshr.3704.769
  18. Kerstens, J., Ruys, E., Zwarts, J. (1996-2001). Lexicon of Linguistics. Utrecht Institute of Linguistics, Utrecht Univ.
  19. Dekrom, G. (1995). Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments. Journal of Speech Hearing Res., 38, 794-811. https://doi.org/10.1044/jshr.3804.794

Cited by

  1. Measuring Correlation between Mental Fatigues and Speech Features vol.6, pp.2, 2014, https://doi.org/10.13064/KSSS.2014.6.2.003