DOI QR코드

DOI QR Code

A Study of BWE-Prediction-Based Split-Band Coding Scheme

BWE 예측기반 대역분할 부호화기에 대한 연구

  • Published : 2008.08.31

Abstract

In this paper, we discuss a method for efficiently coding the high-band signal in the split-band coding approach where an input signal is divided into two bands and then each band may be encoded separately. Generally, and especially through the research on the artificial bandwidth extension (BWE), it is well known that there is a correlation between the two bands to some degree. Therefore, some coding gain could be achieved by utilizing the correlation. In the BWE-prediction-based coding approach, using a simple linear BWE function may not yield optimal results because the correlation has a non-linear characteristic. In this paper, we investigate the new coding scheme more in details. A few representative BWE functions including linear and non-linear ones are investigated and compared to find a suitable one for the coding purpose. In addition, it is also discussed whether there are some additional gains in combining the BWE coder with the predictive vector quantizer which exploits the temporal correlation.

본 논문에서는 입력신호를 하위대역 (low-band)과 상위대역 (high-band)으로 나누어 각 대역을 개별적으로 부호화하는 대역분할 부호화 (split-band coding) 방식에 있어서, 상인대역 신호를 효율적으로 부호화하는 방법에 대해 다룬다. 일반적으로 그리고 특히, 그 동안 대역폭 확장법 (Bandwidth Extension, BWE)에 관한 연구를 통하여 두 대역 사이에 일정 정도의 상관관계가 존재한다는 사실이 밝혀져 있다. 따라서 두 대역간에 예측 부호화 기법을 도입함으로써 부호화 효율을 향상시킬 수 있다. BWE 예측기반 부호화 기법과 관련하여, 단순히 선형 BWE 함수를 이용하는 것은 두 대역간의 관계가 비선형성을 가지고 있으므로 최적의 결과를 얹기 어렵다. 따라서 비선형 BWE 함수를 포함한 다양한 예측 함수들의 성능비교를 통하여 가장 적절한 예측기를 선택하고자 하는 노력이 필요하다. 본 논문에서는 몇몇 대표적인 BWE 함수를 이용한 주파수 대역간 예측 부호화 방법에 대해 살펴 보고 각각의 성능을 평가한다. 또한 BWE 예측기반 부호화기를 (주파수)공간상의 중복제거 기술로 볼 때, 시간적 중복 제거 기술 즉, 예측 벡터 양자화기 (predictive vector quantizer)와의 결합이 부호화 효율향상에 상승효과가 있는지에 대해서도 검토한다.

Keywords

References

  1. P. Jax,"Bandwidth extension for speech,"in Audio Bandwidth Extension, E. Larsen and R. M. Aarts (Ed.), (John Wiley & Sons, 2004), Chap.6, pp.171-235
  2. 3GPP TS 26.290, Audio codec processing functions; Extended Adaptive Multi-Rate Wideband (AMR-WB+) codec; Transcoding functions, (June 2004)
  3. 3GPP TS 26.404, General audio codec audio processing functions; Enhanced aacPlus general audio codec; Enhanced aacPlus encoder SBR part, (Sept. 2004)
  4. M. Dietz, L. Liljeryd, K. Kjorling, and O. Kunz, "Spectral Band Replication, a novel approach in audio coding," 112th AES Convention, Preprint 5553, May 2002
  5. B. Geiser and P. Vary, "Backwards compatible wideband telephony in mobile networks: CELP watermarking and bandwidth extension," ICASSP 4, 533-536, April 2007
  6. M. Nilsson, S. V. Andersen, and W. B. Kleijn, "On the mutual information between frequency bands in speech," ICASSP 3, 1327-1330, June 2000
  7. M. Nilsson, H. Gustafsson, S. V. Andersen, and W. B. Kleijn, "Gaussian mixture model based mutual information between frequency bands in speech," ICASSP 1, 525-528, May 2002
  8. P. Jax and P. Vary, "Feature selection for improved bandwidth extension of speech signals," ICASSP 1, 697-700, May 2004
  9. J. S. Garofolo, L. F. Fisher, J. G. Fiscus, D. S. Pallett, and N. L. Dahlgren, "DARPA-TIMIT: Acoustic-Phonetic Continuous Speech Corpus," 1990
  10. 송근배, 김석호, "음성신호의 대역폭 확장을 위한 GMM 방법 및 HMM 방법의 성능평가", 한국음향학회지 27(3), 119-128, 2008
  11. Y. Linde, A. Buzo, and R.M. Gray, "An algorithm for vector quantizer design," IEEE Trans. Commun. 28(1), 84-95, 1980 https://doi.org/10.1109/TCOM.1980.1094577
  12. V. Cuperman and A. Gersho, "Vector predictive coding of speech at 16 kbits/s," IEEE Trans. Commun. 33(7), 685-696, July 1985 https://doi.org/10.1109/TCOM.1985.1096372
  13. H. Khalil, K. Rose, and S. L. Regunathan, "The asymptotic closed-loop approach to predictive vector quantizer design with application in video coding," IEEE Trans. Image Processing 10(1), 15-23, Jan. 2001 https://doi.org/10.1109/83.892439
  14. H. Khalil and K. Rose, "Predictive vector quantizer design using deterministic annealing," IEEE Trans. Signal Processing 51(1), 244-254, Jan. 2003 https://doi.org/10.1109/TSP.2002.806582
  15. R. Hagen,"Spectral quantization of cepstral coefficients," ICASSP 1, 509-512, April 1994