DOI QR코드

DOI QR Code

Statistical analysis on long-term change of jitter component on continuous speech signal

음성신호의 Jitter 성분의 장시간 변화에 관한 통계적 분석

  • Jo, Cheolwoo (School of Electrical, Electronics and Control Engineering, Changwon National University)
  • 조철우 (창원대학교 전기전자제어공학부)
  • Received : 2020.10.18
  • Accepted : 2020.12.17
  • Published : 2020.12.31

Abstract

In this study, a method for measuring the jitter component in continuous speech is presented. In the conventional jitter measurement method, pitch variabilities are commonly measured from the sustained vowels. In the case of continuous speech, such as a spoken sentence, distortion occurs with the existing measurement method owing to the influence of prosody information according to the sentence. Therefore, we propose a method to reduce the pitch fluctuations of prosody information in continuous speech. To remove this pitch fluctuation component, a curve representing the fluctuation is obtained via polynomial interpolation for the pitch track in the analysis interval, and the shift is removed according to the curve. Subsequently, the variability of the pitch frequency is obtained by a method of measuring jitter from the trajectory of the pitch from which the shift is removed. To measure the effects of the proposed method, parameter values before and after the operations are compared using samples from the Kay Pentax MEEI database. The statistical analysis of the experimental results showed that jitter components from the continuous speech can be measured effectively by proposed method and the values are comparable to the parameters of sustained vowel from the same speaker.

본 연구에서는 연속음성에서의 장시간 jitter 성분 측정 방법에 대해 고찰하였다. 기존의 jitter측정방법으로는 지속 발성한 모음을 대상으로 변동성을 측정하는 방법을 주로 사용하여왔다. 문장음성 등 연속음성의 경우는 문장에 따른 운율정보의 영향으로 기존의 측정법으로는 왜곡이 발생하게 된다. 이에 연속 발성에 대해 운율정보의 피치 변동을 상쇄시키는 방법을 제안하고자 한다. 피치 변동을 제거하는 방법으로는 분석구간내에서의 피치 변동을 다항식 보간법에 의해 변동 경향을 대표하는 곡선을 구하고 그 곡선을 기준으로 변이를 제거하였다. 이후 변이가 제거된 피치의 궤적으로부터 jitter를 측정하는 방법을 적용하여 피치 주파수의 변동성을 측정하고 기존의 지속모음에 의한 측정 방법과 비교하였다. 제안한 방법의 효용성 측정을 위해 Kay Pentax MEEI DB의 음성 표본을 사용하였다. 통계분석 결과 제안된 방법에 의해 연속음성으로부터 측정한 jitter 값은 동일 화자의 지속모음으로부터 측정한 파라미터 값과 유사한 변동성을 보여 주었다.

Keywords

References

  1. Fourcin, A. (2009). Aspects of voice irregularity measurement in connected speech. Folia Phoniatrica et Logopaedica, 61(3), 126-136. https://doi.org/10.1159/000219948
  2. Gerratt, B. R., Kreiman, J., & Garellek, M. (2016). Comparing measures of voice quality from sustained phonation and continuous speech. Journal of Spech, Language, and Hearing Research, 59(5), 994-1001. https://doi.org/10.1044/2016_JSLHR-S-15-0307
  3. Kisenwether, J. S, & Prosek, R. A. (2017). Long-term and short-term period and amplitude perturbation measurements: Are they all needed? Annals of Otolaryngology and Rhinology, 4(4), 1172.
  4. MacCallum, J.K., Zhang, Y., Jiang, J.J. (2011). Vowel selection and its effects on perturbation and nonlinear dynamic measures. Folia Phoniatrica et Logopaedica, 63(2), 88-97. https://doi.org/10.1159/000319786
  5. Manfredi, C., Giordano, A., Schoentgen, J., Fraj, S., Bocchi, L., & Bejonchkere, P. H. (2011). Perturbation measurements in highly irregular voice signals: Performances/validity of analysis software tools. Biomedical Signal Processing and Control, 7, 409-416. https://doi.org/10.1016/j.bspc.2011.06.004
  6. Massachusetts Eye and Ear Infirmary. (1994). Voice disorders database (Version 1.03 CD-ROM). Lincoln Park, NJ: Kay Elemetrics.
  7. Schoentgen, J. (1989). Jitter in sustained vowels and isolated sentences produced by dysphonic speakers. Speech Communication, 8(1), 61-79. https://doi.org/10.1016/0167-6393(89)90068-X
  8. Schoentgen, J., & Guchteneere, R. (1995). Time series analysis of Jitter. Journal of Phonetics, 23(1), 189-201. https://doi.org/10.1016/S0095-4470(95)80042-5
  9. Teixeira, J.P., & Golcalves, A. (2014). Accuracy of jitter and shimmer measurements. Procedia Technology, 16, 1190-1199. https://doi.org/10.1016/j.protcy.2014.10.134
  10. Vasilakis, M., & Stylianou, Y. (2009). Voice pathology detection based on short-term jitter estimations in running speech. Folia Phoniatrica et Logopaedica, 61(3), 153-170. https://doi.org/10.1159/000219951