DOI QR코드

DOI QR Code

Improved Phase Synthesis for Parametric Stereo Audio Coding

파라메트릭 스테레오 오디오 부호화를 위한 향상된 위상 합성 기법

  • Hyun, Dong-Il (School of Electrical and Electronic Engineering, Yonsei University) ;
  • Park, Young-Cheol (Computer and Telecommunications Engineering Division, Yonsei University) ;
  • Youn, Dae Hee (School of Electrical and Electronic Engineering, Yonsei University)
  • 현동일 (연세대학교 전기전자공학과) ;
  • 박영철 (연세대학교 컴퓨터정보통신공학부) ;
  • 윤대희 (연세대학교 전기전자공학과)
  • Received : 2013.09.29
  • Accepted : 2013.12.02
  • Published : 2013.12.25

Abstract

Parametric stereo(PS) audio coding is a specific version of spatial audio coding. In this paper, the problem due to the conventional synthesis of phase differences. In the conventional upmix matrix, phase differences are synthesized not only on downmix signal but also ambient signal, which violates the assumption that the ambient signals are anti-phased. Deterioration due to the phase synthesis is analyzed, especially, for low interchannel correlation. To solve this problem, new upmix matrix is proposed, which synthesizes phase differences only on downmix signal. The performance of the proposed upmix matrix is verified by the subjective listening tests.

파라메트릭 스테레오 오디오 부호화는 공간 오디오 기법 중 스테레오에 특화된 부호화 기법이다. 본 논문에서는 기존의 파라메트릭 스테레오 기법에서 채널간 위상차 합성시 발생하는 문제점을 분석하였다. 기존의 업믹스 행렬에서는 채널간 위상차를 다운믹스 신호뿐만 아니라 잔향신호에도 합성하고 이로 인하여 반위상 관계를 위반한다. 채널간 상관도가 낮을 때, 잔향 성분에 대한 채널간 위상차 합성으로 인하여 발생하는 음질열화를 분석하였다. 이러한 문제점들을 해결하기 위하여 신호 모델을 만족할 수 있도록 주요 성분에만 채널간 위상차를 합성하는 업믹스 행렬을 제안하였다. 주관적 음질 평가를 통하여 제안된 업믹스 행렬의 성능을 검증하였다.

Keywords

References

  1. J. Breebaart, S. van de Par, A. Kohlrausch and E. Schuijers, "Parametric Coding of Stereo Audio" EURASIP J. Appl. Signal Process., vol 9, pp. 1305-1322, 2004.
  2. J. Breebaart, G. Hotho, J. Koppens, E. Schuijers, W. Oomen, and S. van de Par, "Background, Concept and Architecture for the Recent MPEG Suround Standard on Multichannel Audio Compression" J. Audio Eng. Soc. vol 55, pp. 331-351, 2007.
  3. C. Faller and F. Baumgarte, "Binaural Cue Coding-Part II: Schemes and applications," IEEE Trans. on Speech and Audio Proc., vol. 11, no. 6, Nov. 2003.
  4. 3rd Generation Partnership Project (3GPP), "Enhanced aacPlus general audio codec: Encoder specification parametric stereo part," 3GPP TS 26 series Rel. 9 (2009).
  5. M. Wolters, K. Kjorling, D. Homm, and H. Purnhagen, "A closer look into MPEG-4 high efficiency AAC," in Proc. 115th AES Convention, New York, NY, USA, October 2003, preprint 5871.
  6. Study on ISO/IEC 23003-3:201x/DIS, Information technology-MPEG audio technologies-Part 3: Unified speech and audio coding, N12013 (2011).
  7. J. Kim, E. Oh, & Julien Robilliard, "Enhanced stereo coding with phase parameters for MPEG Unified Speech and Audio Coding," in Proc. 127th AES Convention, New York, NY, USA, Oct. 2009, preprint 7875.
  8. D. Hyun et al., "Robust Interchannel Correlation (ICC) Estimation Using Constant Interchannel Time Difference (ICTD) Compensation," in Proc. 127th AES Convention, New York, NY, USA, Oct, 2009, preprint 7934.
  9. M. Kim, E. Oh, & H. Shim, "Stereo audio coding improved by phase parameters," in Proc. 129th AES Convention, New York, NY, USA, Nov. 2010, preprint 8289.
  10. E. Oh and M. Kim, "Enhanced stereo algorithms in the unified speech and audio coding," in Proc. AES 43rd Int. Conf., Pohang, Korea, Sep. 2011.
  11. D. Hyun et al., "Enhanced Interchannel Correlation (ICC) Synthesis for Spatial Audio Coding," in Proc. AES 43rd Int. Conf., Pohang, Korea, Sep. 2011.
  12. J. Lapierre and Roch Lefebvre, "On Improving Parametric Stereo Audio Coding", in Proc. 120th AES Convention, Paris, France, 2006, May 2009, preprint 7875.
  13. R. Dressler, "Dolby surround pro logic II decoder principles of operation," Tech. Rep. S00/13238, Dolby Laboratories Inc., 1988, available at http://www.dolby.com
  14. H. Purnhagen: "Low Complexity Parametric Stereo Coding in MPEG-4," 7th International Conference on Audio Effects (DAFX-04), Naples, Italy, October 2004.
  15. ITU-R BS.1534-1, "Method for the Subjective Assessment of Intermediate Sound Quality (MUSHRA)," International Telecommunications Union, Geneva, Switzerland.