국어 낭독체 발화의 운율경계 예측

Prediction of Break Indices in Korean Read Speech

  • 김효숙 ((주)언어과학부설 음성공학연구소) ;
  • 김정원 ((주)언어과학부설 음성공학연구소) ;
  • 김선주 ((주)언어과학부설 음성공학연구소) ;
  • 김선철 ((주)언어과학부설 음성공학연구소) ;
  • 김삼진 ((주)언어과학부설 음성공학연구소) ;
  • 권철홍 (대전대학교 정보통신공학과)
  • 발행 : 2002.06.01

초록

This study aims to model Korean prosodic phrasing using CART(classification and regression tree) method. Our data are limited to Korean read speech. We used 400 sentences made up of editorials, essays, novels and news scripts. Professional radio actress read 400sentences for about two hours. We used K-ToBI transcription system. For technical reason, original break indices 1,2 are merged into AP. Differ from original K-ToBI, we have three break index Zero, AP and IP. Linguistic information selected for this study is as follows: the number of syllables in ‘Eojeol’, the location of ‘Eojeol’ in sentence and part-of-speech(POS) of adjacent ‘Eojeol’s. We trained CART tree using above information as variables. Average accuracy of predicting NonIP(Zero and AP) and IP was 90.4% in training data and 88.5% in test data. Average prediction accuracy of Zero and AP was 79.7% in training data and 78.7% in test data.

키워드