Study on the Performance of Spectral Contrast MFCC for Musical Genre Classification

스펙트럼 대비 MFCC 특징의 음악 장르 분류 성능 분석

  • 서진수 (강릉원주대학교 전자공학과)
  • Received : 2010.03.25
  • Accepted : 2010.05.13
  • Published : 2010.05.31

Abstract

This paper proposes a novel spectral audio feature, spectral contrast MFCC (SCMFCC), and studies its performance on the musical genre classification. For a successful musical genre classifier, extracting features that allow direct access to the relevant genre-specific information is crucial. In this regard, the features based on the spectral contrast, which represents the relative distribution of the harmonic and non-harmonic components, have received increased attention. The proposed SCMFCC feature utilizes the spectral contrst on the mel-frequency cepstrum and thus conforms the conventional MFCC in a way more relevant for musical genre classification. By performing classification test on the widely used music DB, we compare the performance of the proposed feature with that of the previous ones.

본 논문에서는 새로운 형태의 스펙트럼 특징인 스펙트럼 대비 MFCC (SCMFCC)를 제안하고 음악 장르 분류 성능을 분석하였다. 음악 장르 분류를 위해서는 장르 간의 차이를 두드러지게 할 수 있는 특징을 사용해야 하므로, 음악의 화음 구조 및 강약을 잘 표현하는 스펙트럼 대비 특징들이 관심을 받아왔다. 본 논문에서 제안된 SCMFCC는 멜 켑스트럼 상에서 스펙트럼의 대비를 이용하여 기존의 MFCC를 음악 분류에 적합하도록 변형했다. 널리 사용되고 있는 음악 장르 데이터베이스에서 실험을 수행하여, 제안된 SCMFCC 특징의 음악 장르 분류 성능을 기존의 다른 특징들과 비교하였다.

Keywords

References

  1. M. Casey, R. Veltkamp, M. Goto, M. Leman, C. Rhodes, and M. Slaney, "Content-based music information retrieval: Current directions and future challenges," Proceedings of the IEEE, vol. 96, no. 4, 2008.
  2. E. Pampalk, A. Flexer, and G. Widmer, "Improvements of audio-based music similarity and genre classification," in Proc. Int. Conf. on Music Info. Retrieval (ISMIR-05), Sep. 2005.
  3. 이강규, 윤원중, 박규식, "한국 전통음악 (국악)에 대한 자동 장르 분류 시스템 구현," 한국음향학회지, vol. 24, no. 1, pp. 29-37, 2005.
  4. A. Meng, P. Ahrendt, J. Larsen, and L.K. Hansen, "Temporal feature integration for music genre classification," IEEE Trans. Audio, Speech, Language Process., vol. 15, no. 5, July 2007.
  5. R.O. Duda, P.E. Hart, and D.G. Stork, Pattern Classification, Wiley-Interscience, 2001.
  6. N. Scaringella, G. Zoia, and D. Mlynek, "Automatic genre classification of music content: a survey," IEEE Signal Processing Magazine, vol. 23, no. 2, pp. 133-141, Mar. 2006.
  7. Martin F. McKinney, Jeroen Breebaart, "Features for audio and music classification," in Proc. Int. Conf. on Music Info. Retrieval (ISMIR-03), 2003.
  8. J. Bergstra, N. Casagrande, D. Erhan, D. Eck, and B. Kegl, "Aggregate features and ADABOOST for music classification," Machine Learning, vol. 65, no. 2, pp. 473-484, Dec. 2006. https://doi.org/10.1007/s10994-006-9019-7
  9. D. N. Jiang, L. Lu, H. J. Zhang, J. H. Tao, and L. H. Cai, "Music type classification by spectral contrast feature,"in Proc. IEEE ICME 02, vol. 1, pp. 113-116, 2002.
  10. K. West, S. Cox, "Features and classifiers for the automatic classification of musical audio signals," in Proc. Int. Conf. on Music Info. Retrieval (ISMIR-04), 2004.
  11. I. Panagakis, E. Benetos, and C. Kotropoulos, "Music genre classification: A multilinear approach," in Proc. Int. Conf. on Music Info. Retrieval (ISMIR-08), Sep. 2008.
  12. A. Holzapfel and Y. Stylianou, "Musical genre classification using nonnegative matrix factorization-based features," IEEE Trans. Audio, Speech, Language Process., vol.16, no. 2, Feb. 2008.