Development of a Listener Position Adaptive Real-Time Sound Reproduction System

청취자 위치 적응 실시간 사운드 재생 시스템의 개발

  • 이기승 (건국대학교 정보통신대학 전자공학부) ;
  • 이석필 (전자부품연구원 방송통신융합 연구센터)
  • Received : 2010.08.03
  • Accepted : 2010.10.06
  • Published : 2010.10.31

Abstract

In this paper, a new audio reproduction system was developed in which the cross-talk signals would be reasonably cancelled at an arbitrary listener position. To adaptively remove the cross-talk signals according to the listener's position, a method of tracking the listener position was employed. This was achieved using the two microphones, where the listener direction was estimated using the time-delay between the two signals from the two microphones, respectively. Moreover, room reverberation effects were taken into consideration where linear prediction analysis was involved. To remove the cross-talk signals at the left-and right-ears, the paths between the sources and the ears were represented using the KEMAR head-related transfer functions (HRTFs) which were measured from the artificial dummy head. To evaluate the usefulness of the proposed listener tracking system, the performance of cross-talk cancellation was evaluated at the estimated listener positions. The performance was evaluated in terms of the channel separation ration (CSR), a -10 dB of CSR was experimentally achieved although the listener positions were more or less deviated. A real-time system was implemented using a floating-point digital signal processor (DSP). It was confirmed that the average errors of the listener direction was 5 degree and the subjects indicated that 80 % of the stimuli was perceived as the correct directions.

본 논문에서는 두 개의 스피커를 이용한 청취 환경에서 좌, 우 채널의 간섭 신호를 제거하기 위한 새로운 오디오 시스템을 개발하였다. 간섭 제거는 청취자의 위치에 따라 적응적으로 이루어져야 하기 때문에, 청취 위치를 추적하기 위한 기법이 적용되었다. 청취자 위치 추적은 2개의 마이크로폰을 통하여 이루어지며 채널 간 시간 지연을 이용하여 청취자의 방향을 추정하도록 하였다. 또한 잔향 환경에서의 사용을 고려하여 선형 예측 기법을 이용한 잔향 제거 기법이 적용되었다. 좌,우 채널의 간섭제거를 위한 음원-귀 간의 경로는 KEMAR 머리전달함수를 이용하여 나타내었다. 사용된 청취자 방향 측정 시스템의 유용성을 평가하기 위해 추정된 위치에서 채널 간섭의 성능을 평가하였다. 평가 척도로 채널 분리 비를 사용하였으며, 실험적인 결과, 사용자의 실제 위치와 추정된 위치 간에 다소 차이가 있더라도 -10 dB의 채널 분리비가 얻어짐을 확인 할 수 있었다. 제안된 알고리즘은 부동소수점 디지털 신호처리 프로세서를 이용하여 실시간 구현되었으며 청취자 평균 방향 오차는 5도, 주관적 간섭 제거율은 평균적으로 80 % 얻어짐을 알 수 있었다.

Keywords

References

  1. J. J. Lopez, F. Orduna and A. Gonzalez, "Modeling and measurement of cross-talk cancellation zones for small displacements of the listener in transaural sound reproduction with different loudspeaker arrangements", AES 109th Convention, 2000.
  2. M. R. Bai and C.-C. Lee, "Comprehensive analysis of loudspeaker span effects on crosstalk cancellation in spatial sound reproduction", AES 120th Convention, 2006.
  3. J. Rose , P. Nelson, B. Rafaely and T. Takeuchi, "Sweet spot size of virtual acoustic imaging systems at asymmetric listener locations," J. Acoust. Soc. Am., vol. 112, no. 5, pp. 1992-2002, 2002. https://doi.org/10.1121/1.1510532
  4. C. Kyriakakis, T. Holman, J.-S. Lim, H. Hong and H. Neven, "Signal processing, acoustics, and psychoacoustics for high quality desktop audio," J. Visual Comm. and Image Rep., vol. 8, no. 1, pp. 51-61, 1998.
  5. P. G. Georgiou, A. Mouchtaris, S. I. Roumeliotis and C. Kyriakakis, "Immersive sound rendering using laser-based tracking", AES 109th Convention, 2000.
  6. S. Kim, S. Jang, D. Kong and S. Bang, "Adaptive virtual surround sound rendering method for an arbitrary listening position", AES 30th Int. Conference, 2007.
  7. S. Merchel and S. Groth, "Analysis and implementation of a stereophonic play back system for adjusting the sweet spot to the listener's position", AES 126th Convention, 2009.
  8. N. Sakamoto, T. Gotoh, T. Kogure and M. Shimbo, "Controlling soundimage localization in stereophonic reproduction", J. Audio Eng. Soc., vol. 29, no. 11, pp. 794-799, 1981.
  9. B. Gardner and K. Martin, KEMAR HRTF data, ftp://sound. media.mit.edu/ pub/Data/KEMAR (last viewed 8/21/08), 1994.
  10. M. Jian, A. C. Kot and M. H. Er, "Performance study of time delay estimation in a room enviroment", Proc. IEEE ICASSP, pp. V554-V557, 1998.
  11. C. H. Knapp ad G. C. Carter, "The generalized correlation method for estimation of time delay", IEEE Trans. Acoust., Speech, Singal Process., vol. 24, no. 4, pp. 320-327, 1976. https://doi.org/10.1109/TASSP.1976.1162830
  12. J. R. Hopgood and P. J. W. Rayner, "A probabilistic framework for subband autoregressive model applied to room acoustics", Proc. IEEE Signal Processing Workshop on Statistical Signal Processing, pp. 492-495, 2001.
  13. TMS320C6713 Floating-point digital signal processor, Texas Instruments, Nov., 2002.
  14. TMS320C6000 Programmer's Guide, Texas Instruments, Aug., 2002.
  15. O. Kirkeby, P. A. Nelson and H. Hamada, "Fast deconvolution of multichannel systems usng regularization", IEEE Trans. Speech and Audio Process., vol. 6, no. 2, pp. 189-194, 1998. https://doi.org/10.1109/89.661479