DOI QR코드

DOI QR Code

The Utility of Perturbation, Non-linear dynamic, and Cepstrum measures of dysphonia according to Signal Typing

음성 신호 분류에 따른 장애 음성의 변동률 분석, 비선형 동적 분석, 캡스트럼 분석의 유용성

  • 최성희 (대구가톨릭대학교 언어청각치료학과) ;
  • 최철희 (대구가톨릭대학교 언어청각치료학과)
  • Received : 2014.08.18
  • Accepted : 2014.09.07
  • Published : 2014.09.30

Abstract

The current study assessed the utility of acoustic analyses the most commonly used in routine clinical voice assessment including perturbation, nonlinear dynamic analysis, and Spectral/Cepstrum analysis based on signal typing of dysphonic voices and investigated their applicability of clinical acoustic analysis methods. A total of 70 dysphonic voice samples were classified with signal typing using narrowband spectrogram. Traditional parameters of %jitter, %shimmer, and signal-to-noise ratio were calculated for the signals using TF32 and correlation dimension(D2) of nonlinear dynamic parameter and spectral/cepstral measures including mean CPP, CPP_sd, CPPf0, CPPf0_sd, L/H ratio, and L/H ratio_sd were also calculated with ADSV(Analysis of Dysphonia in Speech and VoiceTM). Auditory perceptual analysis was performed by two blinded speech-language pathologists with GRBAS. The results showed that nearly periodic Type 1 signals were all functional dysphonia and Type 4 signals were comprised of neurogenic and organic voice disorders. Only Type 1 voice signals were reliable for perturbation analysis in this study. Significant signal typing-related differences were found in all acoustic and auditory-perceptual measures. SNR, CPP, L/H ratio values for Type 4 were significantly lower than those of other voice signals and significant higher %jitter, %shimmer were observed in Type 4 voice signals(p<.001). Additionally, with increase of signal type, D2 values significantly increased and more complex and nonlinear patterns were represented. Nevertheless, voice signals with highly noise component associated with breathiness were not able to obtain D2. In particular, CPP, was highly sensitive with voice quality 'G', 'R', 'B' than any other acoustic measures. Thus, Spectral and cepstral analyses may be applied for more severe dysphonic voices such as Type 4 signals and CPP can be more accurate and predictive acoustic marker in measuring voice quality and severity in dysphonia.

Keywords

References

  1. Awan, S.N., & Roy, N. (2005). Acoustic prediction of voice type in women with functional dysphonia. Journal of Voice, 19(2), 268-82. https://doi.org/10.1016/j.jvoice.2004.03.005
  2. Awan, S.N., Roy, N., & Dromey, C. (2009). Estimating dysphonia severity in continuous speech: application of a multi-parameter spectral/cepstral model. Clinical Linguistics & Phonetics, 23(11), 825-841. https://doi.org/10.3109/02699200903242988
  3. Awan, S.N., Roy, N., Jette, M.E., Meltzner, G.S., & Hillman, R.E. (2010). Quantifying dysphonia severity using a spectral/cepstral-based acoustic index: comparison with auditory-perceptual judgments from the CAPE-V. Clinical Linguistics & Phonetics, 24(9), 742-758. https://doi.org/10.3109/02699206.2010.492446
  4. Balasubramanium, R. K., Bhat, J. S., Fahim, I. S., & Raju, I.R. (2011). Cepstral analysis of voice in unilateral adductor vocal fold palsy. Journal of Voice, 25(3), 326-329. https://doi.org/10.1016/j.jvoice.2009.12.010
  5. Carding, P.N., Steen, I.N., Webb, A., MacKenzie, K., Deary, I.J., & Wilson, J.A. (2004). The reliability and sensitivity to change of acoustic measures of voice quality. Clinical Otolaryngology and allied Sciences, 29(5), 538-44. https://doi.org/10.1111/j.1365-2273.2004.00846.x
  6. Choi, S. H., Zhang, Y., Jiang, J.J., Bless, D.M., & Welham, N.V. (2012). Nonlinear dynamic-based analysis of severe dysphonia in patients with vocal fold scar and sulcus vocalis. Journal of Voice, 26(5), 566-576. https://doi.org/10.1016/j.jvoice.2011.09.006
  7. Heman-Ackah, Y.D., Heuer, R.J., Michael, D.D., Ostrowski, R., Horman, M., Baroody, M.M., & Sataloff, R.T. (2003). Cepstral peak prominence: a more reliable measure of dysphonia. The Annals of Otology, Rhinology, and Laryngology, 112(4), 324-333. https://doi.org/10.1177/000348940311200406
  8. Jiang, J.J., Zhang, Y., & McGilligan, C. (2006). Chaos in voice, from modeling to measurement. Journal of Voice, 20, 2-17. https://doi.org/10.1016/j.jvoice.2005.01.001
  9. Kang, Y., & Seong, C. (2012). A cepstral analysis of breathy voice with vocal fold paralysis. Journal of the Korean Society of Speech Sciences 4(2), 89-94. (강영애.성철재(2012). 성대마비로 인한 기식 음성에 대한 Cepstral 분석. 말소리와 음성과학, 4(2), 89-94). https://doi.org/10.13064/KSSS.2012.4.2.089
  10. Maryn, Y., Roy, N., De Bodt M., Van Cauwenberge, P., & Corthals, P. (2009). Acoustic measurement of overall voice quality: a meta-analysis. Journal of the Acoustical Society of America. 126(5), 2619-2634. https://doi.org/10.1121/1.3224706
  11. Radish, K. B., Bhat, J. S., & Prasad, N. (2010). Cepstral analysis of voice in persons with vocal nodules. Journal of Voice, 24(6), 651-653. https://doi.org/10.1016/j.jvoice.2009.07.008
  12. Seo, I., & Seong, C. (2013). Voice quality of dysarthric speakers in connected speech. Journal of the Korean Society of Speech Sciences, 5(4), 33-41. (서인효.성철재(2013). 연결발화에서 마비말화자의 음질 특성. 말소리와 음성과학, 5(4), 33-41.) https://doi.org/10.13064/KSSS.2013.5.4.033
  13. Shim, H. J., Jang, H. R., Shin, H. B., & Ko, D.H. (2014). Spectral and Cepstral analyses of Esophageal Speakers. Journal of the Korean Society of Speech Sciences, 6(2), 47-54. (심희정, 장효령, 신희백, 고도흥(2014). 식도발성화자 음성의 spectral & cepstrum 분석. 말소리와 음성과학, 6(2), 47-54.) https://doi.org/10.13064/KSSS.2014.6.2.047
  14. Sprecher, A., Olszewski, A., Jiang, J.J., & Zhang, Y. (2010). Updating signal typing in voice: addition of type 4 signals. Journal of Acoustical Society of America, 127(6), 3710-16. https://doi.org/10.1121/1.3397477
  15. Titze, I.R. (1995). Workshop on acoustic voice analysis: Summary statement. National Center for Voice and Speech; Iowa City, IA.
  16. Yang, A.C., Wang, S.J., Lai, K.L., Tsai, C.F., Yang, C.H., Hwang, J.P., Lo, M.T., Huang, N.E., Peng, C.K., & Fuh, J.L. (2013). Cognitive and neuropsychiatric correlates of EEG dynamic complexity in patients with Alzheimer's disease. Progress in neuro-psychopharmacology, 47, 52-61. https://doi.org/10.1016/j.pnpbp.2013.07.022
  17. Zhang, Y., & Jiang, J.J. (2008). Acoustic analyses of sustained and running voices from patients with laryngeal pathologies. Journal of Voice, 22, 1-9. https://doi.org/10.1016/j.jvoice.2006.08.003

Cited by

  1. Usefulness of Cepstral Acoustic Index for Estimating Objective Dysphonia Severity vol.22, pp.3, 2017, https://doi.org/10.12963/csd.17411
  2. A Comparison of Cepstral and Spectral Measures according to Measurement Position in a Reading Passage vol.22, pp.4, 2017, https://doi.org/10.12963/csd.17433
  3. Acoustic and Auditory-Perceptual Evaluation as Predictor of Voice Recovery after Laryngeal Microsurgery in Patients with Vocal Polyp vol.61, pp.7, 2018, https://doi.org/10.3342/kjorl-hns.2018.00024
  4. Predicting Normal and Pathological Voice using a Cepstral Based Acoustic Index in Sustained Vowels versus Connected Speech vol.23, pp.4, 2018, https://doi.org/10.12963/csd.18550