DOI QR코드

DOI QR Code

Research on Construction of the Korean Speech Corpus in Patient with Velopharyngeal Insufficiency

구개인두부전증 환자의 한국어 음성 코퍼스 구축 방안 연구

  • Lee, Ji-Eun (Department of Otorhinolaryngology, Chosun University College of Medicine) ;
  • Kim, Wook-Eun (Department of Biomedical Engineering, Seoul National University College of Medicine) ;
  • Kim, Kwang Hyun (Department of Otorhinolaryngology, Seoul National University College of Medicine) ;
  • Sung, Myung-Whun (Department of Otorhinolaryngology, Seoul National University College of Medicine) ;
  • Kwon, Tack-Kyun (Department of Otorhinolaryngology, Seoul National University College of Medicine)
  • 이지은 (조선대학교 의과대학 이비인후과학교실) ;
  • 김욱은 (서울대학교 의과대학 의공학과교실) ;
  • 김광현 (서울대학교 의과대학 이비인후과학교실) ;
  • 성명훈 (서울대학교 의과대학 이비인후과학교실) ;
  • 권택균 (서울대학교 의과대학 이비인후과학교실)
  • Received : 2012.06.15
  • Accepted : 2012.07.26
  • Published : 2012.08.25

Abstract

Background and Objectives We aimed to develop a Korean version of the velopharyngeal insufficiency (VPI) speech corpus system. Subjects and Method After developing a 3-channel simultaneous speech recording device capable of recording nasal/oral and normal compound speech separately, voice data were collected from VPI patients aged more than 10 years with/without the history of operation or prior speech therapy. This was compared to a control group for which VPI was simulated by using a french-3 nelaton tube inserted via both nostril through nasopharynx and pulling the soft palate anteriorly in varying degrees. The study consisted of three transcriptors: a speech therapist transcribed the voice file into text, a second transcriptor graded speech intelligibility and severity and the third tagged the types and onset times of misarticulation. The database were composed of three main tables regarding (1) speaker's demographics, (2) condition of the recording system and (3) transcripts. All of these were interfaced with the Praat voice analysis program, which enables the user to extract exact transcribed phrases for analysis. Results In the simulated VPI group, the higher the severity of VPI, the higher the nasalance score was obtained. In addition, we could verify the vocal energy that characterizes hypernasality and compensation in nasal/oral and compound sounds spoken by VPI patients as opposed to that characgerizes the normal control group. Conclusion With the Korean version of VPI speech corpus system, patients' common difficulties and speech tendencies in articulation can be objectively evaluated. Comparing these data with those of the normal voice, mispronunciation and dysarticulation of patients with VPI can be corrected.

Keywords

Acknowledgement

This study was supported by a grant from the Ministry of Health and Welfare (No. 2010-0020859).