DOI QR코드

DOI QR Code

융복합 시스템의 8kbps에 있어서 APC-MPC에 관한 연구

A Study on APC-MPC in 8kbps of Convergence System

  • 이시우 (상명대학교 정보통신공학과)
  • Lee, See-Woo (Dept. of Information and Telecommunication Engineering)
  • 투고 : 2015.04.21
  • 심사 : 2015.07.20
  • 발행 : 2015.07.28

초록

유성음원과 무성음원을 사용하는 멀티펄스 음성부호화 방식(MPC)에 있어서, 유성음의 파형에서 일그러짐이 발생한다. 이러한 문제를 해결하기 위해, 재생파형의 일그러짐이 감소하도록 피치구간 마다 멀티펄스의 진폭과 위치를 보정하는 APC-MPC를 제안하였다. 또한 융복합 시스템의 8kbps 부호화 조건에서 APC-MPC의 SNRseg를 검토하고 부호화 시스템으로 구현하였다. APC-MPC의 SNRseg를 평가한 결과, APC-MPC의 남자음성에서 14.3dB, 여자음성에서 13.9dB 임을 확인할 수 있었다. 본 방법은 셀룰러폰이나 스마트폰과 같이 Low Bit Rate의 음원을 사용하여 음성신호를 부호화하는 방식에 활용할 수 있을 것으로 기대된다.

In a MPC(Multi-Pulse Coding) using excitation source of voiced and unvoiced, it would be a distortion of voice waveform. This is caused by normalization of synthesis speech waveform of voiced in the process of restoration. To solve this problem, this paper present APC-MPC of amplitude-position compensation in a multi-pulses each pitch interval in order to reduce distortion of synthesis waveform. Also, I was implemented that the APC-MPC in coding system. And I evaluate the SNRseg of APC-MPC in 8kbps coding condition of convergence system. As a result, SNRseg of APC-MPC was 13.9dB for female voice and 14.3dB for male voice respectively. And so, I expect to be able to this method for cellular phone and smart phone using excitation source of low bit rate.

키워드

참고문헌

  1. Selma Ozaydm, Buyurman Baykal:"Matrix quantization and mixed excitation based linear predictive speech coding at very low bit rates",Speech Communication 41,p381-392, 2003 https://doi.org/10.1016/S0167-6393(03)00009-8
  2. Ghaemmaghami, S., Sridharan, S.:"Very low rate speech coding using temporal decomposition".IEE Electron. Lett.35(6), p456-457.1999 https://doi.org/10.1049/el:19990316
  3. McCree, A.V, Barnwell, T.P.,:"A mixed excitation LPC vocoder model for low bit rate speech coding", IEEE Trans. Speech Audio Process, p242-250,1995
  4. Phu Chien Nguyen, Masato Akagi, Binh Phu Nguyen: "Limited error based event localizing temporal decomposition and its application to variable-rate seech coding", Speech Communication 49, p292-304, 2007 https://doi.org/10.1016/j.specom.2007.02.007
  5. LeBlanc, W.P., Bhattacharya,B.,Mahmoud, S.A.: "Efficient search and design procedures for robust multi stage vector quantization of LPC parameters for 4kbps speech coding".IEEE Trans. Speech Audio Process.p373-385.1993
  6. David A. Krubsack and Russell J. Niederjohn:"An Autocorrelation Pitch Detector and Voicing Decision with Confidence Measures Developed for Noise-Corrupted Speech", IEEE, Transactions of Signal Processing, Vol.39, No.2, 1991
  7. Kazunori Ozawa, Shigeru Ono and Takashi Araseki:"A study on pulse search algorithm for multipulse excited speech coder realization", IEEE, Jounal on Selected areas in Communications, Vol. SAC-4, No.1, 1986
  8. B.S.Atal and J.R.Remdo:"A New Medel of LPC Excitation for Producing Natural Sounding Speech at Low Bit Rates", IEEE,ICASSP, p614-617, 1982
  9. Z.A.Putnins, G.A.Wilson, J.Kumar and R.D. Trupp: "A Multi-Pulse LPC Synthesizer for Telecommunications use",IEEE,ICASSP,Mar,1985
  10. Kazunori OZAWA, Takashi ARASEKI: "Multi-Pulse Excited Speech Coding Utilizing Pitch Information at Rates Between 9.6 and 4.8 kbit/s", IEICE, Vol.J72-D-2, No.8, 1989
  11. K.Krishna, V.L.N.Murty,.R.Ramakrishnan:"Vector quantization of excitation gains in speech coding", Signal Processing 81,p203-209, 2001 https://doi.org/10.1016/S0165-1684(00)00200-0
  12. Widrow B. and Hoff M. E.:"Adaptive Switching Circuit", IRE WESCON Conv. Rec, June 2000
  13. Campbell,J.P.,Tremain,T.E.:"Voiced/unvoiced classification of speech with applications to the U.S. Government LPC-10e algorithm", Proc.IEEE Int.Conf. on Acoustics, Speech, Sinal Processing, p473-476.1986
  14. LEAH.J.SIEGE and ALANC. BESSEY: "Voiced/Unvoiced/Mixed Excitation Classification of Speech", IEEE, Vol. ASSP-30, No.3, 1982
  15. HIDEFUMI KOBATAKE:"Optimization of Voiced/Unvoiced Decisions in Nonstationary Noise Environments", IEEE, Vol. ASSP-35, No.1, 1987
  16. Nobuhiko KITAWAKI, Fumitada ITAKURA and Shuzo SAITO: "Optimum Coding of Transmission Parameters in PARCOR Speech Analysis Synthesis System", IEICE, Vol. J61-A No.2, 1978