Multi Mode Harmonic Transform Coding for Speech and Music

  • Kim, Jonghark (Dept. of Radio Engineering, Chungbuk National University) ;
  • Shin, Jae-Hyun (Dept. of Radio Engineering, Chungbuk National University) ;
  • Lee, Insung (Dept. of Radio Engineering, Chungbuk National University)
  • 발행 : 2003.09.01

초록

A multi-mode harmonic transform coding (MMHTC) for speech and music signals is proposed. Its structure is organized as a linear prediction model with an input of harmonic and transform-based excitation. The proposed coder also utilizes harmonic prediction and an improved quantizer of excitation signal. To efficiently quantize the excitation of music signals, the modulated lapped transform(MLT) is introduced. In other words, the coder combines both the time domain (linear prediction) and the frequency domain technique to achieve the best perceptual quality. The proposed coder showed better speech quality than that of the 8 kbps QCELP coder at a bit-rate of 4 kbps.

키워드

참고문헌

  1. R. V. Cox, 'Speech coding standards,' Speech Coding and Synthesis, 2, W. B. Kleijn, and K. K. Paliwell Eds., Elsevier, 1995
  2. A. M. Kondoz, 'Coding strategies and standards,' Digital Speech,5, John Wiley, 1994
  3. R. Y. Qiao, 'Mixed wideband speech and music coding using a speech/music discriminator,' IEEE TENCON, 605-608, 1997
  4. R. Lefebvre, R. Salami, C. Laflamme, and J. P. Adoul, 'High quality coding of wideband audio signals using Transform Coded Excitation (TCX),' Proc, ICASSP-94, 1, 193-196, 1994
  5. T. Moriya, N. Iwakami, A. Jin, K. Ikeda, and S. Miki, 'A design of transform coder for both speech and audio signals at 1 bit/samples,' Proc. IEEE Int. Coni. Acount., Speech, Signal Processing, 1371-1374, 1997
  6. S. A. Ramprashad, 'A two stage hybrid embedded speech/ audio coding structure,' Proc. IEEE Int. Cont. Acount., Speech, Signal Processing, 337-340, 1998
  7. ISO/IEC JTC1/SC29/wG11, 'Information technology-coding of audiovisual objects part 3: audio sub part2: parametric coding.' N1903PAR, 1997
  8. B. Yegnanarayana, Christophe d'Alessandro and Vassilis Darsinos, 'An iterative algorithm for decomposition of speech signals into periodic and aperiodic components'" IEEE Transaction on speech and audio processing, 6 (1), 1-11,1998 https://doi.org/10.1109/89.650304
  9. R. J. McAulay, and T. F. Ouartieri, 'Sinusoidal coding,' Speech Coding and Synthesis, 4, W. B. Kleijn, and K. K. Paliwell Eds., Elsevier, 1995
  10. P. Lupini, and V. Cuperman, 'Nonsquare transform vector quantization,' IEEE Signal Precessing letters, 3 (1), January 1996
  11. A. V. McCree, K. Trung, E, B, George, T. P. Barnwell and V. Viswanathan, 'A 2.4 kbil/s MELP coder candidate lor the new U. S. federal standard,' Proc IEEE Int. Cont. Acoust., Speech, Signal Processing, 1, 200-203, May 1996
  12. H. Malvar 'Fast algorithms for orthogonal and biothogonal modulated lapped transforms,' Proc IEEE Symposium, Advances in Digital Filtering and Signal Processing, 159-163, 1998
  13. P. J. A. OeJaco, W. Gardner and C. Lee, 'QCELP: north american COMA digital cellular variable rate speech coding standard,' Proc. IEEE Workshop on speech Coding for Telecommunications, (sainte-Adele. Quebec), 5-6, 1993
  14. O. R. Ladd, and J. Terken, 'Modelling intra- and inter-speaker pitch range variation,' Proceedings at the 13th International Congress at Phonetic Sciences Stockholm (eds. EJenius, K. & Branderud, P,), 2, 386-389, 1995