Search | Korea Science

Kwon, Jin-Man;Ko, Il-Ju;Jang, Dae-Sik
- Journal of the Korea Society of Computer and Information
- /
- v.14 no.12
- /
- pp.75-87
- /
- 2009
This paper, which is about the method of creating the audio fingerprint and comparing with the audio data, presents how to distinguish music using the characteristics of audio data. It is a process of applying the Difference of Gaussian (DoG: generally used for recognizing images) to the audio data, and to extract the music that changes radically, and to define the location of fingerprint. This fingerprint is made insensitive to the changes of sound, and is possible to extract the same location of original fingerprint with just a portion of music data. By reducing the data and calculation of fingerprint, this system indicates more efficiency than the pre-system which uses pre-frequency domain. Adopting this, it is possible to indicate the copyrighted music distributed in internet, or meta information of music to users.
https://doi.org/10.9708/jksci.2009.14.12.075 인용 PDF

Seo, Jin-Soo;Lee, Seung-Jae
- The Journal of the Acoustical Society of Korea
- /
- v.27 no.5
- /
- pp.239-243
- /
- 2008
This paper proposes a new audio identification method based on a combination of the instantaneous and dynamic spectral features of the audio spectrum. Especially we propose the spectro-temporal subband centroids that are easy to compute and effective to summarize the instantaneous and dynamic spectral variations. Experimental results demonstrate that the identification performance can be greatly improved by combining both the spectral and the temporal subband centroids.
https://doi.org/10.7776/ASK.2008.27.5.239 인용 PDF KSCI

Kim, Young-Jun;Lee, Sung-Min;Lee, Jean-Ho;Kim, Tai-Yun
- Proceedings of the Korea Information Processing Society Conference
- /
- 2000.10a
- /
- pp.153-156
- /
- 2000
인터넷을 비롯한 통신기술의 비약적인 발전과 보급, Mpeg, Jpeg 등의 멀티미디어 데이터 압축기술의 발전으로 인하여 문서, 사진, 동영상, 오디오 데이터 등의 다양한 매체들이 디지털화되어 고속으로 전송이 가능하게 되었다. 이러한 멀티미디어 컨텐츠(Multimedia Contents)에 대해 누구나 쉽게 저장과 접근, 이용 가능하게 됨으로써 저작물에 대한 소유권 문제가 큰 이슈로 부각되었다. 본 논문에서는 이러한 멀티미디어 데이터의 저작권을 보호하기 위한 방법들을 살펴보고 각 기법의 특성을 고찰한다. 또한 각 기법들간의 관계 및 체계를 분류하고 저작권 보호에 효율적인 워터마킹(Watermarking)기법과 스테가노그라피(Steganography) 및 핑거프린팅(Fingerprinting)기법을 비교 분석한다.
PDF

Jeong, Byeong-Jun;Kim, Dae-Jin
- The Journal of the Korea Contents Association
- /
- v.12 no.5
- /
- pp.1-9
- /
- 2012
The popularity of digital music and smart phones led to develope noise-robust real-time audio fingerprinting system in various ways. In particular, The Multiple Hashing(MLH) of fingerprint algorithms is robust to noise and has an elaborate structure. In this paper, we propose a filter engine based on MLH to achieve better performance. In this approach, we compose a energy-intensive filter to improve the accuracy of Q/R from music database and a statistic filter to remove continuity and redundancy. The energy-intensive filter uses the Discrite Cosine Transform(DCT)'s feature gathering energy to low-order bits and the statistic filters use the correlation between searched fingerprint's information. Experimental results show that the superiority of proposed algorithm consists of the energy and statistical filtering in noise environment. It is found that the proposed filter engine achieves more robust to noise than Philips Robust Hash(PRH), and a more compact way than MLH.
https://doi.org/10.5392/JKCA.2012.12.05.001 인용 PDF KSCI

Kim, Hyoung-Gook
- The Journal of The Korea Institute of Intelligent Transport Systems
- /
- v.6 no.2
- /
- pp.95-106
- /
- 2007
This paper proposes an effective tempo feature extraction method for music information retrieval. The tempo information is modeled by the narrow-band temporal modulation components, which are decomposed into a modulation spectrum via joint frequency analysis. In implementation, the tempo feature is directly extracted from the modified discrete cosine transform coefficients, which is the output of partial MP3(MPEG 1 Layer 3) decoder. Then, different features are extracted from the amplitudes of modulation spectrum and applied to different music information retrieval tasks. The logarithmic scale modulation frequency coefficients are employed in automatic music emotion classification and music genre classification. The classification precision in both systems is improved significantly. The bit vectors derived from adaptive modulation spectrum is used in audio fingerprinting task That is proved to be able to achieve high robustness in this application. The experimental results in these tasks validate the effectiveness of the proposed tempo feature.
PDF