DOI QR코드

DOI QR Code

Audio Fingerprinting Based Spatial Audio Reproduction System

오디오 핑거프린팅기반 입체음향 재현 시스템

  • Ryu, Sang Hyeon (Department of Wireless Communication Engineering, Kwangwoon University) ;
  • Kim, Hyoung-Gook (Department of Wireless Communication Engineering, Kwangwoon University)
  • 류상현 (광운대학교 전파공학과) ;
  • 김형국 (광운대학교 전파공학과)
  • Received : 2013.10.22
  • Accepted : 2013.11.27
  • Published : 2013.12.25

Abstract

This paper proposes a spatial audio reproduction system based on audio fingerprinting that combines the audio fingerprinting and the spatial audio processing. In the proposed system, a salient audio peak pair fingerprint based on modulation spectrum improves the accuracy of the audio fingerprinting system in real noisy environments and spatial audio information as metadata gives a listener a sensation of being listening to the sound in the space, where the sound is actually recorded.

본 논문에서는 오디오 핑거프린팅 방식과 스파셜 오디오 처리 방식을 결합한 오디오 핑거프린팅 기반 입체음향 재현 시스템을 제안한다. 제안된 시스템에서는 변조스펙트럼 기반의 명확한 오디오 정점 핑거프린트를 이용하여 잡음환경에서 오디오 핑거프린팅 시스템의 검색정확도를 향상시켰으며, 메타데이터로 제공되는 스파셜 오디오 정보는 청취자에게 소리가 실제로 녹음된 공간에서 소리를 듣는 것 같은 느낌을 준다.

Keywords

References

  1. P. Cano, E. Batlle, T. Kalker, and J. Haitsma, "A review of algorithms for audio fingerprinting," IEEE Workshop on Multimedia Signal Processing, pp. 169-173, Dec. 2002.
  2. J. Cerquides, "A real time audio fingerprinting system for advertisement tracking and reporting in FM radio," Radioelektronika, 2007. 17th International Conference, pp. 1-4, Apr. 2007.
  3. A. Sinitsyn, "Duplicate song detection using audio fingerprinting for consumer electronics devices," IEEE International Symposium on Consumer Electronics, pp.1-6. Jun. 2006.
  4. J. Haitsma, and T. Kalker, "A highly robust audio fingerprinting system," 3rd International Conference Music Information Retrieval, pp. 107-115, Oct. 2002.
  5. J. I. Seo, and G. O. Gang, "Introduction and Standard Status of High Order Multichannel AudioSystem for Realistic Audio Broadcasting", Electronics and Telecommunications Trends, pp. 49-56, Dec. 2012.
  6. ISO/IEC 23008, Information technology, "High efficiency coding and media delivery in heterogeneous environments," MPEG-H, Jun. 2013.
  7. I. S. Jang, S. K. Beack, J. I. Seo, and D. Y. Jang, "Overview of MPEG Surround," Journal of Broadcast Engineering, pp. 181-190, Jun. 2006.
  8. J. I. Seo, I. S. Jang, and G. O. Gang, "Overview of object based audio technology in MPEG," Korea Society Broadcast Engineers Magazine, pp. 75-87, Jun. 2009.
  9. S. Baluja, and M. Covell, "Beyond "nearduplicates": learning hash codes for efficient similar-image retrieval," 20th International Conference on. Pattern Recognition, Aug. 2010.
  10. A. Wang, "An industrial strength audio search algorithm," 4th International Conference Music Information Retrieval, pp. 7-13, Oct. 2003.
  11. H.-G. Kim, K. W. Um, J. Y. Kim, Y. Y. She, and X. Zhu, "Method, Medium, and system for music retrieval using modulation spectrum," US Patent, US 7818167 B2, Oct. 2010.
  12. X. Pan, X. Yu, J. Deng, W. Yang, and H. Wang, "Audio fingerprinting based on local energy centroid," IET International Communication Conference on Wireless Mobile and Computing, pp. 351-354, Nov. 2011.
  13. X. Anguera, A. Garzon, and T. Adamek, "MASK: robust local feature for audio fingerprinting," International Conference on Multimedia and Expo, pp. 455-460, Jul. 2012.
  14. M. Malekesmaeili, and R. K. Ward, "A novel local audio fingerprinting algorithm," IEEE 14th International Workshop on Multimedia Signal Processing, pp. 136-140, Sep. 2012.