• Title/Summary/Keyword: Spatial audio coding

Search Result 19, Processing Time 0.097 seconds

Evaluation of Spatial Audio Coding Tools for Multichannel Audio (Spatial Audio Coding 기술의 멀티채널 부호화 성능 비교)

  • Jang Inseon;Seo Jeongil;Mun Hangil;Kang Kyeongok
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.153-156
    • /
    • 2004
  • Spatial Audio Coding (SAC)은 낮은 대역폭에서 다채널/다객체 오디오 신호를 전송하기 위해 제안된 기술이다. 본 논문에서는 MPEG 에서 SAC 기술의 평가 방법으로 채택된 Multi-Stimulus test with Hidden Reference and Anchor (MUSHRA) 실험 절차에 대해서 설명한다. 또한 제 69 차 MPEG 회의에서 제안된 4 개 기관의 SAC 기술에 대한 청취실험을 수행하고 그 결과를 분석한다.

  • PDF

Multi-channel Audio Service in a Terrestrial-DMB System Using VSLI-Based Spatial Audio Coding

  • Seo, Jeong-Il;Moon, Han-Gil;Beack, Seung-Kwon;Kang, Kyeong-Ok;Hong, Jae-Keun
    • ETRI Journal
    • /
    • v.27 no.5
    • /
    • pp.635-638
    • /
    • 2005
  • Spatial audio coding (SAC) is an extremely high compact representation of encoded multi-channel audio material. This paper suggests a multi-channel audio service in the terrestrial digital multimedia broadcasting (T-DMB) system using a novel SAC tool, which is called a virtual source location information (VSLI)-based SAC tool. Intensive experiments are presented to evaluate the validity of the proposed VSLI-based SAC tool, and prototypical systems are also presented to demonstrate the reliability of the proposed multi-channel T-DMB system in real applications.

  • PDF

An efficient multichannel spatial audio coding method based on inter channel correlation (채널상관성에 기반한 효율적인 멀티채널 spatial audio coding 방법)

  • Lee Byonghwa;Beack Seungkwon;Seo Jeongil;Hahn Minsoo
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.157-160
    • /
    • 2004
  • Spatial Audio Coding 방법 중 하나인 Binaural Cue Coding 방법은 다채널 다객체 오디오 신호를 모노나 스테레오로 다운 믹스한 신호와 spatial 큐를 전송해 디코더에서 복원하는 기술로 작은 비트 율로 다채널 오디오 신호를 전송 복원해 내는 기술이다. 본 논문은 BCC 코딩 방법에서 채널 상관도를 나타내는 ICC 파라메터에 따라 spatial cue 종류를 달리함으로써 전송되는 부가정보의 비트 율을 줄이는 방법을 제안한다.

  • PDF

An efficient method of spatial cues and compensation method of spectrums on multichannel spatial audio coding (멀티채널 Spatial Audio Coding에서의 효율적인 Spatial Cues 사용과 그에 따른 Spectrum 보상방법)

  • Lee, Byong-Hwa;Beack, Seung-Kwon;Seo, Jeong-Gil;Han, Min-Soo
    • MALSORI
    • /
    • no.53
    • /
    • pp.157-169
    • /
    • 2005
  • This paper proposes an efficiently representing method of spatial cues on multichannel spatial audio coding. The Binaural Cue Coding (BCC) method introduced recently represents multichannel audio signals by means of Inter Channel Level Difference (ICLD) or Source Index (SI). We tried to express more efficiently ICLD and SI information based on Inter Channel Correlation in this paper. We adopt different spatial cues according to ICC and propose a compensation method of empty spectrums created by using SI. We performed a MOS test and measuring spectral distortion. The results show that the proposed method can reduce the bitrate of side information without large degradation of the audio quality.

  • PDF

The Development of audio codec using binaural cue coding technologies (Binaural Cue Coding 기술을 이용한 오디오 코덱 구현)

  • Seo Jeongil;Kang Kyeongok;Lee Byonghwa;Hahn Minsoo
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.137-140
    • /
    • 2004
  • 낮은 대역폭에서 다채널 다객체 오디오 신호를 전송하기위해 새롭게 제안된 Spatial Audio Coding 기술은 멀티채널 오디오 신호를 다운믹싱하고 나머지 채널은 음향공간상의 위치정보를 나타내는 파라미터들로 압축하여 표현하는 파라메트릭 압축 방식이다. 본 논문에서는 Spatial Audio Coding 기술중의 하나인 BCC 기술을 이용하여 스테레오 오디오 코덱을 구현하고, 주관듣기평가 실험을 통하여 AAC와 비슷한 성능을 나타내면서도 높은 압축율을 얻을 수 있음을 확인하였다.

  • PDF

Angle-Based Virtual Source Location Representation for Spatial Audio Coding

  • Beack, Seung-Kwon;Seo, Jeong-Il;Moon, Han-Gil;Kang, Kyeong-Ok;Hahn, Min-Soo
    • ETRI Journal
    • /
    • v.28 no.2
    • /
    • pp.219-222
    • /
    • 2006
  • Virtual source location information (VSLI) has been newly utilized as a spatial cue for compact representation of multichannel audio. This information is represented as the azimuth of the virtual source vector. The superiority of VSLI is confirmed by comparison of the spectral distances, average bit rates, and subjective assessment with a conventional cue.

  • PDF

Design and Development of T-DMB Multichannel Audio Service System Based on Spatial Audio Coding

  • Lee, Yong-Ju;Seo, Jeong-Il;Beack, Seung-Kwon;Jang, Dae-Young;Kang, Kyeong-Ok;Kim, Jin-Woong;Hong, Jin-Woo
    • ETRI Journal
    • /
    • v.31 no.4
    • /
    • pp.365-375
    • /
    • 2009
  • In this paper, a terrestrial digital multimedia broadcasting (T-DMB) multichannel audio broadcasting system based on spatial audio coding is presented. The proposed system provides realistic multichannel audio service via T-DMB with a small increase of data rate as well as backward compatibility with the conventional stereo-based T-DMB player. To reduce the data rate for additional multichannel audio signals, we compress the multichannel audio signals using the sound source location cue coding algorithm, which is an efficient parametric multichannel audio compression technique. For compatibility, we use the dependent property of an elementary stream descriptor, and this property should be ignored in a conventional T-DMB player. To verify the feasibility of the proposed system, we implement the T-DMB multichannel audio encoder and a prototype player. We perform a compatibility test using the T-DMB multichannel audio encoder and conventional T-DMB players. The test demonstrates that the proposed system is compatible with a conventional T-DMB player and that it can provide a promisingly rich audio service.

Status of MPEG Audio Standard (MPEG 오디오 표준화 동향)

  • Seo, Jeong-Il;Kang, Kyeong-Ok
    • Proceedings of the IEEK Conference
    • /
    • 2008.06a
    • /
    • pp.49-52
    • /
    • 2008
  • This paper briefly introduces the current status of MPEG Audio Subgroup activities for standardizing a new audio coding technologies. Currently MPEG Audio Subgroup focused on spatial audio coding tools for compressing multiple audio objects and unified coding tools for presenting the consistence performance on speech and audio signal at the same time. Also a new MAF (MPEG Application Format) for interactive music was introduced at the 84th MPEG meeting.

  • PDF

Improved Synthesis Method of Negative Inter-channel Correlation Parameter Based on Anti-phase Primary Component (반위상 주요성분에 기반을 둔 개선된 음수 채널간 상관도 파라미터 합성 기법)

  • Hyun, Dong-Il;Lee, Seok-Pil;Park, Young-Cheol;Youn, Dae-Hee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.6
    • /
    • pp.410-418
    • /
    • 2012
  • Parametric stereo(PS) and MPEG surround(MPS) are major spatial audio coding(SAC) tools. In this paper, the problem of the inter-channel correlation(ICC) synthesis in the conventional SAC is analyzed. Conventional methods assume that ambient components mixed to two output channels are anti-phased, while the primary components are assumed to be in-phased. This assumption can cause excessive ambient mixing for a negative-valued ICC. As a remedy to this problem, we propose a new ICC synthesis method based on an assumption that the primary components are anti-phased each other for a negative ICC. The proposed method is also applied to the approximation which works in practice. The performance of the proposed method was evaluated by computer simulations and the subjective listening tests verified that the proposed method is effective in not only headphones but also loudspeakers playback.

An Efficient Representation Method for ICLD with Robustness to Spectral Distortion

  • Beack, Seung-Kwon;Seo, Jeong-Il;Kang, Kyung-Ok;Hanh, Min-Soo
    • ETRI Journal
    • /
    • v.27 no.3
    • /
    • pp.330-333
    • /
    • 2005
  • The Inter-Channel Level Difference (ICLD) is a cue parameter to estimate spectral information in a binaural cue coding that has been recently in the spotlight as a multichannel audio signal compression technique. Even though the ICLD is an essential parameter, it is generally distorted by quantization. In this paper, a new modified ICLE representation method to minimize the quantization distortion is proposed by adopting a flexible determination of the reference channel and the unidirectional quantization. Our experimental result confirms that the proposed method improves the multichannel audio output quality even with the reduced bit-rate.

  • PDF