Audio Fingerprint Based on Combining Binary Fingerprints

Jang, Dal-Won;Lee, Seok-Pil;

doi:10.5909/JBE.2012.17.4.659

Journal of Broadcast Engineering (방송공학회논문지)

Volume 17 Issue 4
/
Pages.659-669
/
2012
/
1226-7953(pISSN)
/
2287-9137(eISSN)

The Korean Institute of Broadcast and Media Engineers (한국방송∙미디어공학회)

DOI QR Code

Audio Fingerprint Based on Combining Binary Fingerprints

이진 핑거프린트의 결합에 의한 강인한 오디오 핑거프린트

Jang, Dal-Won (Digital Media Research Center, KETI) ;
Lee, Seok-Pil (Digital Media Research Center, KETI)

장달원 (전자부품연구원 디지털 미디어 연구센터) ;
이석필 (전자부품연구원 디지털 미디어 연구센터)

Received : 2012.05.11
Accepted : 2012.07.12
Published : 2012.07.30

https://doi.org/10.5909/JBE.2012.17.4.659 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

This paper proposes the method to extract a binary audio fingerprint by combining several base binary fingerprints. Based on majority voting of base fingerprints, which are designed by mimicking the fingerprint used in Philips fingerprinting system, the proposed fingerprint is determined. In the matching part, the base fingerprints are extracted from the query, and distance is computed using the sum of them. In the experiments, the proposed fingerprint outperforms the base binary fingerprints. The method can be used for enhancing the existing binary fingerprint or for designing a new fingerprint.

이 논문에서는 이진 핑거프린트의 결합을 이용해 새로운 이진 오디오 핑거프린트를 만드는 방법을 제안하다. 필립스 핑거프린팅 시스템을 활용하여, 그 시스템에서 활용한과 비슷한 특성을 가질 것이라 예상되는 기본 이진 핑거프린트를 여러 개 추출하고, 기본 이진 핑거프린트들의 투표로 하나의 이진 오디오 핑거프린트를 결정한다. 정합단에서는 이진 핑거프린트를 이용하는 것이 아니라, 기본 이진 핑거프린트들의 합을 이용하여 거리를 계산한다. 실험을 통해서 제안하는 방법으로 만들어진 핑거프린트가 그것의 기초가 되는 기본 이진 핑거프린트들보다 향상된 성능을 보임을 확인할 수 있었다. 이 방법을 이용해서 기존의 이진 핑거프린트의 성능을 강화하거나 새로운 이진 핑거프린트를 만들 수 있을 것이라 기대된다.

Keywords

References

P. Cano, E. Batlle, T. Kalker, and J. Haitsma, "A review of audio fingerprinting," Journal of VLSI signal processing, Vol. 41, No. 3, pp. 271-284, 2005 https://doi.org/10.1007/s11265-005-4151-3
J. Haitsma and T. Kalker, "A highly robust audio fingerprinting system," in Proc. Int. Conf. Music Information Retrieval, Paris, France, 2002, pp. 107-115.
C. Burges, J. Plat, and S. Jana, "Distortion discriminant analysis for audio fingerprinting," IEEE Trans. Speech Audio Process., vol. 11, no. 3, pp. 165-174, May 2003. https://doi.org/10.1109/TSA.2003.811538
J. S. Seo, M. Jin, S. Lee, D. Jang, S. Lee, and C. D. Yoo, "Audio fingerprinting based on normalized spectral subband moments," IEEE Signal Process. Lett., vol. 13, no. 4, pp. 209-212, Apr. 2006. https://doi.org/10.1109/LSP.2005.863678
Y.Ke, D. Hoiem, and R. Sukthankar, "Computer vision for music identification," in Proc. CVPR, 2005, vol. 1, pp. 597-604.
K. Covell and S. Baluja, "Known-audio detection using waveprint: Spectrogram fingerprinting by wavelet hashing," in Proc. ICASSP, Hawaii, pp. 237-240., 2007
S. Kim and C. D.Yoo, "Boosted binary audio fingerprint based on spectral subband moments," in Proc. ICASSP, Hawaii, pp. 241-244, 2007
D. Jang, C. D.Yoo, S. Lee, S. Kim, and T. Kalker, "Pairwise boosted audio fingerprint," IEEE Trans. on Information Forensics and Security, Vol. 4, Iss. 4, pp 995-1004, 2009 https://doi.org/10.1109/TIFS.2009.2034452
M.S. Park, H.R. Kim, and S.H. Yang, "Frequency-temporal filtering for a robust audio fingerprinting scheme in real-noise environments," ETRI Journal, vol. 28, no. 4, pp. 509-512, Aug. 2006. https://doi.org/10.4218/etrij.06.0205.0135
Y. Liu, K. Cho, H. S. Yun, J. W. Shin, and N. S. Kim, "DCT based multiple hashing technique for robust audio fingerprinting," in Proc. ICASSP, 2009
D. Jang, C. D. Yoo and T. Kalker, "Distance metric learning for content identification," IEEE Trans. on Information Forensics and Security, Vol. 5, Iss. 4, pp. 932-944, 2010. https://doi.org/10.1109/TIFS.2010.2064769
M. Jin and C. D. Yoo, "Quantum hashing for multimedia," IEEE Trans. on Information Forensics and Security, Vol. 4, Iss. 4, pp. 982-994, 2009 https://doi.org/10.1109/TIFS.2009.2033221
P. J. O. Doets, "Distortion estimation in compressed music using only audio fingerprint," IEEE Trans. on Audio, Speech, and Language Processing, Vol. 16, Iss. 2, pp. 302-317, 2008 https://doi.org/10.1109/TASL.2007.911716
R. Schapire and Y. Singer, "Improved boosting algorithms using confidence-rated predictions," Machine Learning, vol. 37, no. 3, pp. 297-336, 1999. https://doi.org/10.1023/A:1007614523901
C. M. Bishop, Pattern recognition and machine learning, Springer, 2006

Journal of Broadcast Engineering (방송공학회논문지)

Audio Fingerprint Based on Combining Binary Fingerprints

이진 핑거프린트의 결합에 의한 강인한 오디오 핑거프린트

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)