DOI QR코드

DOI QR Code

Adaptive TCX Windowing Technology for Unified Structure MPEG-D USAC

  • Lee, Tae-Jin (Broadcasting & Telecommunications Convergence Research Laboratory, ETRI) ;
  • Beack, Seung-Kwon (Broadcasting & Telecommunications Convergence Research Laboratory, ETRI) ;
  • Kang, Kyeong-Ok (Broadcasting & Telecommunications Convergence Research Laboratory, ETRI) ;
  • Kim, Whan-Woo (Department of Electronics Engineering, Chungnam National University)
  • Received : 2011.09.21
  • Accepted : 2011.11.14
  • Published : 2012.06.01

Abstract

The MPEG-D unified speech and audio coding (USAC) standardization process was initiated by MPEG to develop an audio codec that is able to provide consistent quality for mixed speech and music contents. The current USAC reference model structure consists of frequency domain (FD) and linear prediction domain (LPD) core modules and is controlled using a signal classifier tool. In this letter, we propose an LPD single-mode USAC structure using an adaptive widowing-based transform-coded excitation module. We tested our system using official test items for all mono-evaluation modes. The results of the experiment show that the objective and subjective performances of the proposed single-mode USAC system are better than those of the FD/LPD dual-mode USAC system.

Keywords

References

  1. ISO/IEC SC29 WG11 N9519, "Call for Proposals on Unified Speech and Audio Coding," MPEG, Oct. 2007.
  2. ISO/IEC SC29 WG11 N12013, "Study on ISO/IEC 23003- 3:201x/DIS of Unified Speech and Audio Coding," MPEG, Mar. 2011.
  3. ISO/IEC Std. 2003, "Information Technology-Coding of Audio-Visual Objects-Part 3: Audio," ISO/IEC 14496-3.
  4. ISO/IEC Std. 2003, "Bandwidth Extension," ISO/IEC 14496-3, AMD. 1.
  5. Y. Lee et al, "Design and Development of T-DMB Multichannel Audio Service System Based on Spatial Audio Coding," ETRI J., vol. 31, no. 4, Aug. 2009, pp. 365-375. https://doi.org/10.4218/etrij.09.0108.0557
  6. 3GPP TS 26.290 V6.3.0, "Extended Adaptive Multi-rate-Wideband (AMR-WB+) Codec," 2007.
  7. M. Neuendorf et al., "Unified Speech and Audio Coding Scheme for High Quality at Low Bitrates," ICASSP, 2009.
  8. ISO/IEC SC29 WG11 N9638, "Evaluation Guidelines for Unified Speech and Audio Proposals," MPEG, Jan. 2008.

Cited by

  1. Transform Coding Based on Source Filter Model in the MDCT Domain vol.35, pp.3, 2012, https://doi.org/10.4218/etrij.13.0212.0368
  2. A Low-Delay MDCT/IMDCT vol.35, pp.5, 2012, https://doi.org/10.4218/etrij.13.0212.0559
  3. Single-Mode-Based Unified Speech and Audio Coding by Extending the Linear Prediction Domain Coding Mode vol.39, pp.3, 2012, https://doi.org/10.4218/etrij.17.0116.0397