DOI QR코드

DOI QR Code

Spectral and Cepstral Analyses of Esophageal Speakers

식도발성화자 음성의 spectral & cepstral 분석

  • 심희정 (한림대학교 대학원 언어병리청각학과) ;
  • 장효령 (한림대학교 대학원 언어병리청각학과) ;
  • 신희백 (한림대학교 대학원 언어병리청각학과) ;
  • 고도흥 (한림대학교 언어청각학부)
  • Received : 2014.04.28
  • Accepted : 2014.05.28
  • Published : 2014.06.30

Abstract

The purpose of this study was to analyze spectral versus cepstral measurements in esophageal speakers. The comparison between the measurements in thirteen male esophageal speakers was compared with the control group of thirteen normal speakers using the sustained vowel /a/. The main results can be summarized as below: (a) the CPP and L/H ratio of the esophageal group were significantly lower than those of the control group (b) the CPP was significantly correlated with the spectral parameters such as jitter, shimmer, NHR and VTI, and (c) the ROC analysis showed that the threshold of 10.25dB for the CPP achieved a good classification for esophageal speakers, with 100% perfect sensitivity and specificity. Thus, it was known that cepstral-based acoustic measures such as CPP, may be more reliable predictors than other spectral-based acoustic measures such as jitter and shimmer. And it was found that cepstral-based acoustic measures were effective in distinguishing esophageal voice quality from normal voice quality. This research will contribute to establishing a baseline related to speech characteristics in voice rehabilitation with laryngectomees.

Keywords

References

  1. Website, http://www.cancer.go.kr/, September 6, 2013.
  2. Jang, H. R., Shim, H. J., & Ko, D. H. (2013). Acoustic Characteristics of Patients with Total Laryngectomees via Voice Rehabilitation Techniques. Phonetics and Speech Sciences, 5(4), 25-32. (장효령, 심희정, 고도흥 (2013). 후두적출술 환자의 발성법에 따른 음향학적 특성. 말소리와 음성과학, 5(4), 25-32.) https://doi.org/10.13064/KSSS.2013.5.4.025
  3. Debruyne, F., Delaere, P., Wouters, J., & Uwents, P. (1994). Acoustic analysis of tracheo-oesophageal versus oesophageal speech. Journal of Laryngology and Otology, 108, 325-328.
  4. Most, T., Tobin, Y., & Mimran, R. C. (2000). Acoustical and perceptual characteristics of esophageal tracheoesophageal speech production. Journal of Communication Disorder, 33(2), 165-181. https://doi.org/10.1016/S0021-9924(99)00030-1
  5. Choi, S. H., Choi, H. S., Kim, H. S., Lee, S. E., & Pyo, H. Y. (2003). Analysis of acoustic characteristics of vowel and consonants production study on speech proficiency in esophageal speech. Speech Sciences, 10(3), 7-27. (최성희, 최홍식, 김한수, 임성은, 이성은, 표화영 (2003). 식도발성의 숙련 정도에 따른 모음의 음향학적 특징과 자음 산출에 대한 연구. 음성과학, 10(3), 7-27.)
  6. Kang, S. K. Aerodynamic analysis of esophageal speech of the laryngectomees. Ph.D. dissertation Taegu University. (강수균 (2002). 후두적출자의 식도발성시 기류역학적 연구, 대구대학교 박사학위 논문.)
  7. Pyo, H., Choi, H., Lim, S., & Choi, S. (1999). The comparison of the acoustic and aerodynamic characteristics of provox voice and esophageal voice produced by the same laryngectomee. Speech Sciences, 5(1), 121-139. (표화영, 최홍식, 임성은, 최성희. (1999). 동일 후적자가 산출하는 기관식도 발성과 식도 발성에 대한 음향학적 및 공기역학적 특성 비교. 음성과학, 5(1), 121-139.)
  8. Blood, G. W. (1984). Fundamental frequency and intensity measurements in laryngeal and alaryngeal speakers. Journal of Communication Disorders, 17, 319-324. https://doi.org/10.1016/0021-9924(84)90034-0
  9. Stajner-Katusic, S., Horga, D., Musura, M., & Globlek, D. (2006). Voice and speech after laryngectomy. Clinical Linguistics & Phonetics, 20(2/3), 195-203. https://doi.org/10.1080/02699200400026975
  10. Maccallum, J., Cai, L., Zhou, L., Zhang, Y., & Jiang, J. (2007). Acoustic analysis of aperiodic voice: perturbation and nonlinear dynamic properties in esophageal phonation. Journal of Voice, 23(3), 283-290.
  11. Moers, C., Mobius, B., Rosanowski, F., Noth, E., Eysholdt, U., & Haderlein, T. (2012). Vowel- and text-based cepstral analysis of chronic hoarseness. Journal of Voice, 26(4), 416-424. https://doi.org/10.1016/j.jvoice.2011.05.001
  12. Noll, A. M. (1967). Cepstrum pitch determination. Journal of the Acoustic Society of America, 41, 293-309. https://doi.org/10.1121/1.1910339
  13. Watts, C. R., & Awan, S. N. (2011). Use of spectral/cepstral analyses for differentiating normal from hypofunctional voices in sustained vowel and continuous speech contexts. Journal of Speech, Language, and Hearing Research, 54(6), 1525-1537. https://doi.org/10.1044/1092-4388(2011/10-0209)
  14. Lowell, S. Y., Kelley, R. T., Awan, S. N., Colton, R. H., & Chan, N. H. (2012). Spectral- and cepstral-based acoustic features of dysphonic, strained voice quality. Annals of Otology, Rhinology and Laryngology, 121(8), 539-548. https://doi.org/10.1177/000348941212100808
  15. Heman-Ackah, T., Michael, D., & Goding, G. (2002). The relationship between cepstral peak prominence and selected parameters of dysphonia, Journal of Voice, 16(1), 20-27. https://doi.org/10.1016/S0892-1997(02)00067-X
  16. Kumar, B., Bhat, J., & Prasad, N. (2009). Cepstral analysis of voice in persons with vocal nudules. Journal of Voice, 24(6), 651-653.
  17. Awan, S., Roy, N., & Dromey, C. (2009). Estimating dysphonia severity in continuous speech: Application of a multi-parameter spectral/cepstral model. Clinical linguistics & phonetics, 23(11), 825-841. https://doi.org/10.3109/02699200903242988
  18. Hillenbrand, J., Cleveland, R., & Erickson, R. (1994). Acoustic Correlates of Breathy Vocal Quality. Journal of Speech, Language, and Hearing Research, 37, 769-778. https://doi.org/10.1044/jshr.3704.769
  19. Seo, I., & Seong, C. (2013). Voice quality of dysarthric speakers in connected speech. Phonetics and Speech Sciences, 5(4), 33-41. (서인효, 성철재 (2013). 연결발화에서 마비말화자의 음질 특성. 말소리와 음성과학, 5(4), 33-41.) https://doi.org/10.13064/KSSS.2013.5.4.033
  20. Wolfe, V, Fitch J, & Cornell R. (1995). Acoustic prediction of severity in commonly occurring voice problems. Journal of Speech, Language & Hearing Research, 38, 273-279. https://doi.org/10.1044/jshr.3802.273
  21. Shim, H., Jang, H., & Ko, D. (2013). Speech Intelligibility and Vowel Space Characteristics of Alaryngeal Speech. Phonetics and Speech Sciences, 5(4), 17-24. (심희정, 장효령, 고도흥 (2013). 무후두음성의 말 명료도와 모음 공간 특성. 말소리와 음성과학, 5(4), 17-24.) https://doi.org/10.13064/KSSS.2013.5.4.017
  22. Timo, L. (2009). Long term average spectrum in screening of voice quality in speech: untrained male university students, Journal of Voice, Vol. 23, No. 6, 671-676. https://doi.org/10.1016/j.jvoice.2008.03.008
  23. Kent, R. D., & Read, C. (2002). Acoustic Analysis of speech(2nd ed). Singular Thomson Learning.
  24. Hillenbrand, J., & Houde, R. (1996). Acoustic correlates of breathy vocal quality dysphonic voices and continuous speech. Journal of Speech, Language, and Hearing Research, 39, 311-321. https://doi.org/10.1044/jshr.3902.311
  25. Awan, S., & Roy, N. (2009). Outcomes measurement in voice disorders: application of an acoustic index of dysphonia severity. Journal of Speech, Language and Hearing Research, 52(2), 482-499. https://doi.org/10.1044/1092-4388(2008/08-0034)
  26. Lee, M. (2011). A Study of Acoustic Measurement in Connected Speech with Dysphonia. Phonetics and Speech Sciences, 3(4), 109-115. (이명순 (2011). 음성장애 연속구어의 음향학적 분석. 말소리와 음성과학, 3(4), 109-115.)

Cited by

  1. The Utility of Perturbation, Non-linear dynamic, and Cepstrum measures of dysphonia according to Signal Typing vol.6, pp.3, 2014, https://doi.org/10.13064/KSSS.2014.6.3.063
  2. Usefulness of Cepstral Acoustic Index for Estimating Objective Dysphonia Severity vol.22, pp.3, 2017, https://doi.org/10.12963/csd.17411