Automatic Music Summarization Using Similarity Measure Based on Multi-Level Vector Quantization

다중레벨 벡터양자화 기반의 유사도를 이용한 자동 음악요약

  • Kim, Sung-Tak (The School of Engineering at Information and Communications University) ;
  • Kim, Sang-Ho (The School of Engineering at Information and Communications University) ;
  • Kim, Hoi-Rin (The School of Engineering at Information and Communications University)
  • Published : 2007.06.30

Abstract

Music summarization refers to a technique which automatically extracts the most important and representative segments in music content. In this paper, we propose and evaluate a technique which provides the repeated part in music content as music summary. For extracting a repeated segment in music content, the proposed algorithm uses the weighted sum of similarity measures based on multi-level vector quantization for fixed-length summary or optimal-length summary. For similarity measures, count-based similarity measure and distance-based similarity measure are proposed. The number of the same codeword and the Mahalanobis distance of features which have same codeword at the same position in segments are used for count-based and distance-based similarity measure, respectively. Fixed-length music summary is evaluated by measuring the overlapping ratio between hand-made repeated parts and automatically generated ones. Optimal-length music summary is evaluated by calculating how much automatically generated music summary includes repeated parts of the music content. From experiments we observed that optimal-length summary could capture the repeated parts in music content more effectively in terms of summary length than fixed-length summary.

Keywords

References

  1. L, Mani, M, T, Mabury, Advances in Automatic Text Summarization, (MIT Press, 1999)
  2. Y, Gong and X, Liu, 'Summarizing video by minimizing visual content redundancies,' in Proc, IEEE International Conference on Multimedia and Expo, 788-791, Tokyo, Japan, 2001
  3. I, Yahiaoui, B, Merialdo and B, Huet, 'Generating summaries of muli-episode video,' in Proc, IEEE International Conference on Multimedia and Expo, 792-795, Tokyo, Japan, 2001
  4. B, Logan and S, Chu, 'Music summarization using key phrases,' in Proc, IEEE International Conference on Audio, Speech and Signal Processing, 749-752, 2000
  5. C, Xu, Y, Zhu, and a, Tian, 'Automatic music summarization based on temporal, spectral and cepstral feature,' in Proc, IEEE International Conference on Multimedia and Expo, 117-120, 2002
  6. G, Peeter, A, L, Burthe, and X, Rodet, 'Toward automatic music audio summary generation from signal analysis,' in Proc, International Symposium on Music Information Retrieval, 2002
  7. C, Xu, X, Shao, N, C, Maddage, M, S. Kankanhalli, and a, Tian, 'Automatically summarize musical audio using adaptive clustering,' in Proc IEEE International Conference on Multimedia and Expo, 2063-2066, 2004
  8. X, Shao, N, C, Maddage, C, Xu, and M, S, Kankanhalli, 'Automatic music summarization based on music structure analysis,' in Proc, ICASSP, 2, 1169-1172, 2005
  9. X. Shao, C, Xu, Y, Wang, and M, S, Kankanhalli, 'Automatic music summarization in compressed domain,' in Proc, ICASSP, 4,261-264, 2004
  10. X, Shao, C, Xu, and M, S, Kankanhalli, 'A new approach to automatic music video summarization,' in Proc, ICIP, 625-628, 2004
  11. C,Xu, N, C, Maddage, and X, Shao, 'Automatic music classification and summarization,' IEEE Tans, Speech and Audio Process, 13 (3) May 2005
  12. Seoyoung Koh, Jeongsik Park, and Yung-hwan Oh, 'Improvement of mp3-based music summarization using linear regression,' in Proc, The KSPS Fall Conference, 55-58, 2005
  13. L,R, Rabiner and B.H, Juang, Fundamentals of Speech Recognition, (Prentice-Hall, 1993)