DOI QR코드

DOI QR Code

A comparison of CPP analysis among breathiness ranks

기식 등급에 따른 CPP (Cepstral Peak Prominence) 분석 비교

  • Received : 2014.12.07
  • Accepted : 2015.03.18
  • Published : 2015.03.31

Abstract

The aim of this study is to synthesize pathological breathy voice and to make a cepstral peak prominence (CPP) table following breathiness ranks by cepstral analysis to supplement reliability of the perceptual auditory judgment task. KlattGrid synthesizer included in Praat was used. Synthesis parameters consist of two groups, i.e., constants and variables. Constant parameters are pitch, amplitude, flutter, open phase, oral formant and bandwidth. Variable parameters are breathiness (BR), aspiration amplitude (AH), and spectral tilt (TL). Five hundred sixty samples of synthetic breathy vowel /a/ for male were created. Three raters participated in ranking of the breathiness. 217 were proved to be inadequate samples from perceptual judgment and cepstral analysis. Finally, 343 samples were selected. These CPP values and other related parameters from cepstral analysis are classified under four breathiness ranks (B0~B3). The mean and standard deviation of CPP is $16.10{\pm}1.15$ dB(B0), $13.68{\pm}1.34$ dB(B1), $10.97{\pm}1.41$ dB(B2), and $3.03{\pm}4.07$ dB(B3). The value of CPP decreases toward the severe group of breathiness because there is a lot of noise and a small quantity of harmonics.

Keywords

References

  1. Awan S. N., & Roy N. (2006). Toward the development of an objective index of dysphonia severity: a four factor acoustic model. Clinical Linguistic & Phonetics. 20, 1, 35-49. https://doi.org/10.1080/02699200400008353
  2. Awan S. N., & Roy N. (2009). Outcomes measurement in voice disorders: application of an acoustic index of dysphonia severity. Journal of Speech, Language & Hearing Research. 52, 2, 482-499.
  3. Bangayan P., Long C., Alwan A., Kreiman J., & Gerratt B. (1997). Analysis by synthesis of pathological voices using the Klatt synthesizer. Speech communication. 22, 4, 343-368.
  4. Choi S. H.(2013). Speech-language pathologists' voice assessment and voice therapy practices: a survey for standard clinical guideline and evidence-based practice. Communication Sciences & Disorders. 18, 4, 473-485. (최성희(2013). 음성언어재활사의 음성평가와 음성치료의 임상실제: 표준 임상지침과 증거기반중재를 위한 설문조사. 언어청각장애연구. 18, 4, 473-485.)
  5. Eadie T. L., & Baylor C. R.(2006). The effect of perceptual training on inexperienced listener's judgments of dysphonic voice. Journal of Voice. 20, 4, 527-544. https://doi.org/10.1016/j.jvoice.2005.08.007
  6. Gerratt B. R., & Kreiman, J.(2001). Measuring vocal quality with speech synthesis. Journal of the Acoustical Society of America. 110, 5, 2560-2566. https://doi.org/10.1121/1.1409969
  7. Huang D. Z.(1999). Relationship between acoustic measures of voice and judgments of voice quality. http://www.drspeech.com/Paper.html#Relationship
  8. Hillenbrand J., Cleveland R. A., & Erickson R. L.(1994). Acoustic correlates of breathy vocal quality. Journal of Speech and Hearing Research. 37, 769-778. https://doi.org/10.1044/jshr.3704.769
  9. Kang Y. A., Yoon K. C., & Kim J. O.(2012a). An aerodynamic and acoustic analysis of the breathy voice of the patients with thyroidectomy. Journal of the Korean Society of speech Science. 4, 2, 95-104. (강영애, 윤규철, 김재옥(2012a). 갑상선 수술 후 성대마비 환자의 기식 음성에 대한 공기역학적 및 음향적 분석. 말소리와 음성과학. 4, 2, 95-104.)
  10. Kang Y. A. & Seong C. J.(2012b). A Cepstral analysis of breathy voice with vocal fold paralysis. Journal of the Korean Society of speech Science. 4, 2, 89-94. (강영애, 성철재(2012b). 성대마비로 인한 기식 음성에 대한 Cepstral 분석. 말소리와 음성과학. 4, 2, 89-94.) https://doi.org/10.13064/KSSS.2012.4.2.089
  11. Klatt D. H., & Klatt L. C.(1990). Analysis, synthesis, and perception of voice quality variations among female and male talkers. Journal of the Acoustical Society of America. 87, 820-857. https://doi.org/10.1121/1.398894
  12. Kent R. D., & Read C.(2002). Acoustic analysis fo speech, 2nd ed., United States: Singular.
  13. Kent R. D.(1996). Hearing and believing: Some limits to the auditory-perceptual assessment of speech and voice disorders. Journal of Speech Language Pathology of America. 5, 7-23. https://doi.org/10.1044/1058-0360.0503.07
  14. Ma E. & Yiu E.(2006). Multiparametric evaluation of dysphonic severity. Journal of Voice. 20, 3, 380-390. https://doi.org/10.1016/j.jvoice.2005.04.007
  15. Mehta D. & Quatieri, T. F.(2005). Synthesis analysis, and pitch modification of the breathy vowel. 2005 IEEE workshop on applications of signal processing to audio and acoustics.
  16. Shrivastav R., Camacho A., Patel S., & Eddins D. A.(2011). A model for the prediction of breathiness in vowels. Journal of the Acoustical Society of America. 129, 3, 1605-1615. https://doi.org/10.1121/1.3543993
  17. Weenink D.(2009). The KlattGrid speech synthesizer, Proceedings of Interspeech2009. 10, 2059-2062, Brighton, United Kingdom, September 6-10.

Cited by

  1. A Comparison of Cepstral and Spectral Measures according to Measurement Position in a Reading Passage vol.22, pp.4, 2017, https://doi.org/10.12963/csd.17433