3차원 입체 음향 핵심 알고리즘 평가를 위한 DB 설계

An Architecture for 3D Audio Core Algorithm Evaluation DB

  • 황재민 (인하대학교 컴퓨터정보공학과) ;
  • 김정혁 (인하대학교 컴퓨터정보공학과) ;
  • 강상길 (인하대학교 컴퓨터정보공학과)
  • 투고 : 2014.06.02
  • 심사 : 2014.06.15
  • 발행 : 2014.06.30

초록

오디오 산업은 프리미엄 산업으로써 나날이 발전 하고 있다. 입체 음향 시스템에 관한 연구는 많이 진행 되고 있다. 하지만 Audio database, algorithm, evaluation, metadata scheme 이 모두 각각 이루어지고 있다. 하나의 시스템에서 만들어진 audio 알고리즘을 평가 하고, 저장 할 수 있다면 입체 음향 오디오 연구 발전에 도움이 될 것이다. 그래서 이 논문 에서는 실감형 3D 오디오의 알고리즘을 시스템 적으로 평가 할 수 있는 Database Architecture 제안 하고, 이 Database system 구현을 위하여 XML metadata scheme를 정의 하였다. 본 논문에서는 새로운 오디오 평가 DB를 제시하고, 이를 체계적으로 구현하기 위한 설계를 제시하고자 한다.

In this paper an architecture for 3D audio core algorithm evaluation database system. Due to increase of 3D audio system through multimedia device, an evaluation system is required for evaluating the 3D core algorithms for developing 3D audio system. Conventional evaluation systems have some problems. Researchers have to learn usage of evaluation system, in addition it is inefficient to use and search audio sources because audio sources are not indexed in general. To solve these problems, we design the architecture of 3D audio core algorithm evaluation database system enabling to automatically evaluate core algorithms using database management system. Also we define XML metadata scheme for information of saved audio source in database. This approach allows improving efficiency of search audio source and use of audio database.

키워드

참고문헌

  1. Abhijit Jukjarni, H. and Steven Colburn, "Role of spectral detail in sound-source localization," Hearing Research Center and Department of Biomedical Engneering, Boston University, Vol 396, pp. 747-749, 1998.
  2. Jean-Marc Jot, "An Analysis/sybthesis approach to real-time artificial reverberation," Studer Digitec S.A, France Telecom Paris, pp. 221-224, 1992.
  3. Adel Belouchrni, Karim Abed-Meraim, Jean_francois Cardoso, and Eric Moulines, "A Blind Source Separation Technique Using Second-Order Statistics," IEEE, Vol. 45, pp. 434-443, 1997.
  4. Siijeong Lee, Gabken Choi, and SoonHyob Kim, "A method of the cross-talk cancellation for an sound reproduction of 5.1 channel speaker system," The Institute of Electronics and Information Engineers, Vol. 42, pp. 159-166, 2005.
  5. Byoungho Kwon, Youngjin Park, and Youn-sik Park, "Mutiple sound sources localization using the spatially mapped GCC function," ICROS-SICE International Joint Conference, pp. 1773-1776, 2009.
  6. John Rons, Philip Nelson, Boaz Rafaely, and Takashi Takeuchi, "Sweet spot size of virtual acoustic imaging systems at asymmetric listener locaations," Institue of Sound and Vibrations Resrearch, pp. 1992-2002, 2002.
  7. Masataka Goto, Hiroki Hashiguchi, Takuichi Nishimura, Ryuichi, "RWC Music Database: Music Genre Database and Musical Instrument Sound Database," National Institute of Advanced Industrial Science and Technology, 2003.
  8. V. R. Algazi, R. O Duda, D. M Thompson, and C. Avendano "THE CIPIC HRTF DATABASE," Creative Advanced Technology Center, 2001.
  9. Marco Jeub, Magnus Schafer, and Peter Vary, "A binaural room impulse response database for the evaluation of derveration algorithms," Instituete of Communication Systems and Data Processing, IEEE, 2009.
  10. Remi Gribonval, Laurent Benaroya, Emmaual Vincent, Cedric Fevotte, "Proposal for performance measurement in source separation," Symp on Independent Componet Anal and Blind Signal Separation, pp. 763-768, 2003.
  11. Emmanuel Vincent, Remi Gribonval, Cedric Fevotte, "Performance Measurement in Blind Audio Source Separation," IEEE Transaction on audio and language processing, Vol. 14, pp. 1462-1469, 2006. https://doi.org/10.1109/TSA.2005.858005
  12. Guillaume Potard, and Ian Burnett, "An XML-base 3D audio scene metadata scheme," University of Wollongong, 2004.
  13. Malham, D. G., "3-D sound spatialization using Ambisonics techniques", Computer Music J., Vol. 10, No. 4, pp. 58-70, Winter 1995.