Scalable High-quality Speech Reconstruction in Distributed Speech Recognition Environments

분산음성인식 환경에서 서버에서의 스케일러블 고품질 음성복원

  • Yoon, Jae-Sam (Department of Information and Communications Gwangju Institute of Science and Technology) ;
  • Kim, Hong-Kook (Department of Information and Communications Gwangju Institute of Science and Technology) ;
  • Kang, Byung-Ok (Electronics and Telecommunications Research Institute)
  • 윤재삼 (광주과학기술원 정보통신공학과) ;
  • 김홍국 (광주과학기술원 정보통신공학과) ;
  • 강병옥 (한국전자통신연구원)
  • Published : 2007.07.11

Abstract

In this paper, we propose a scalable high-quality speech reconstruction method for distributed speech recognition (DSR). It is difficult to reconstruct speech of high quality with MFCCs at the DSR server. Depending on the bit-rate available by the DSR system, we can send additional information associated with speech coding to the DSR sorrel, where the bit-rate is variable from 4.8 kbit/s to 11.4 kbit/s. The experimental results show that the speech quality reproduced by the proposed method when the bit-rate is 11.4 kbit/s is comparable with that of ITU-T G.729 under both ideal channel and frame error channel conditions while the performance of DSR is maintained to that of wireline speech recognition.

Keywords