DOI QR코드

DOI QR Code

A Study on ACFBD-MPC in 8kbps

8kbps에 있어서 ACFBD-MPC에 관한 연구

  • Lee, See-Woo (Information and Telecommunication, SANGMYUNG University)
  • 이시우 (상명대학교 정보통신공학과)
  • Received : 2016.04.26
  • Accepted : 2016.07.07
  • Published : 2016.07.31

Abstract

Recently, the use of signal compression methods to improve the efficiency of wireless networks have increased. In particular, the MPC system was used in the pitch extraction method and the excitation source of voiced and unvoiced to reduce the bit rate. In general, the MPC system using an excitation source of voiced and unvoiced would result in a distortion of the synthesis speech waveform in the case of voiced and unvoiced consonants in a frame. This is caused by normalization of the synthesis speech waveform in the process of restoring the multi-pulses of the representation segment. This paper presents an ACFBD-MPC (Amplitude Compensation Frequency Band Division-Multi Pulse Coding) using amplitude compensation in a multi-pulses each pitch interval and specific frequency to reduce the distortion of the synthesis speech waveform. The experiments were performed with 16 sentences of male and female voices. The voice signal was A/D converted to 10kHz 12bit. In addition, the ACFBD-MPC system was realized and the SNR of the ACFBD-MPC estimated in the coding condition of 8kbps. As a result, the SNR of ACFBD-MPC was 13.6dB for the female voice and 14.2dB for the male voice. The ACFBD-MPC improved the male and female voice by 1 dB and 0.9 dB, respectively, compared to the traditional MPC. This method is expected to be used for cellular telephones and smartphones using the excitation source with a low bit rate.

Acknowledgement

Supported by : 상명대학교

References

  1. Selma Ozaydm, Buyurman Baykal:"Matrix quantization and mixed excitation based linear predictive speech coding at very low bit rates",Speech Communication 41, pp.381-392, 2003. DOI: http://dx.doi.org/10.1016/S0167-6393(03)00009-8 https://doi.org/10.1016/S0167-6393(03)00009-8
  2. K.Krishna, V.L.N.Murty, K.R.Ramakrishnan:"Vector quantization of excitation gains in speech coding", Signal Processing 81, pp. 203-209, 2001. DOI: http://dx.doi.org/10.1016/S0165-1684(00)00200-0 https://doi.org/10.1016/S0165-1684(00)00200-0
  3. Ghaemmaghami,S., Sridharan,S.:"Very low rate speech coding using temporal decomposition," IEEE Electron. Lett.35(6), pp. 456-457, 1999. DOI: http://dx.doi.org/10.1049/el:19990316 https://doi.org/10.1049/el:19990316
  4. LeBlanc,W.P,Bhattacharya,B.,Mahmoud,S.A.:"Efficient search and design procedures for robust multi stage vector quantization of LPC parameters for 4kbps speech coding," IEEE Trans. Speech Audio Process, pp. 373-385, 1993. DOI: http://dx.doi.org/10.1109/89.242483 https://doi.org/10.1109/89.242483
  5. McCree,A.V.,Barnwell, T.P.: "A mixed excitation LPC vocoder model for low bit rate speech coding", IEEE Trans. Speech Audio Process, pp. 242-250, 1995. DOI: http://dx.doi.org/10.1109/89.397089 https://doi.org/10.1109/89.397089
  6. L. Hodgson, M.E. Jernigan, B.L. Wills: "Nonlinear Multiplicative Cepstral Analysis for Pitch Extraction in Speech," IEEE, S4b.11, 1990. DOI: http://dx.doi.org/10.1109/ICASSP.1990.115616
  7. Lawrence R,Rabiner,Michael J.Cheng, Aarone. Rosenberg, Carol A.McGonegal: "A Comparative Performance Study of Several Pitch Detection Algorithms," IEEE, Vol. ASSP-24, 1976. DOI: http://dx.doi.org/10.1109/TASSP.1976.1162846 https://doi.org/10.1109/TASSP.1976.1162846
  8. Chong Kwan Un, Shin-Chien Yang: "A Pitch Extraction Algorithm Based on LPC Inverse Filtering and AMDF," IEEE, Vol. ASSP-39, 1991. DOI: http://dx.doi.org/10.1109/TASSP.1977.1163005 https://doi.org/10.1109/TASSP.1977.1163005