DOI QR코드

DOI QR Code

An Efficient Time-Frequency Representation for Parametric-Based Audio Object Coding

  • Beack, Seung-Kwon (Broadcasting & Telecommunications Convergence Research Laboratory, ETRI) ;
  • Lee, Tae-Jin (Broadcasting & Telecommunications Convergence Research Laboratory, ETRI) ;
  • Kim, Min-Je (Broadcasting & Telecommunications Convergence Research Laboratory, ETRI) ;
  • Kang, Kyeong-Ok (Broadcasting & Telecommunications Convergence Research Laboratory, ETRI)
  • 투고 : 2011.01.07
  • 심사 : 2011.03.30
  • 발행 : 2011.12.31

초록

Object-based audio coding can provide new music applications with interactivity. To efficiently compress a lot of target audio objects, a subband-based parametric coding scheme has been adopted for MPEG spatial audio object coding. In this letter, the time-frequency (T/F) subband analysis structure is investigated. A reconfigured T/F structure is also proposed to enhance the generating performance of sound scenes such as 'karaoke' and 'solo' play in interactive music scenarios. From the experimental results, it was confirmed that the proposed scheme remarkably improves the SNR and sound quality.

키워드

참고문헌

  1. ISO/IEC 23003-2:2010, "Part 2: Spatial Audio Object Coding," International Standard, Oct. 2010.
  2. T. Lee et al., "A Personalized Preset-based Audio System for Interactive Service," AES Convention, Oct. 2006.
  3. ISO/IEC 14496-3:2001, "Parametric Coding for High Quality Audio," Dec. 2003.
  4. ISO/IEC 23003-1:2007, "Part 1: MPEG Surround," International Standard, Jan. 2007.
  5. C. Faller and R. Baumgarte, "Binaural Cue Coding-Part II: Schemes and Application," IEEE Trans. Speech Audio Proc., vol. 11, no. 6, Nov. 2003.
  6. S. Beack et al., "Angle-Based Virtual Source Location Representation for Spatial Audio Coding," ETRI J., vol. 28, no. 2, Apr. 2006, pp. 219-222. https://doi.org/10.4218/etrij.06.0205.0079
  7. 3GPP TS 26.290, Extended Adaptive Multi-Rate-Wideband Codec (AMR-WB+): Transcoding Functions.
  8. ITU-R Recommendation, Method for the Subjective Assessment of Intermediate Sound Quality (MUSHRA), ITU, BS. 1543-1, Geneva, 2001.