DOI QR코드

DOI QR Code

Multiclass Music Classification Approach Based on Genre and Emotion

  • Jonghwa Kim (Department of Artificial Intelligence, Cheju Halla University)
  • Received : 2024.05.03
  • Accepted : 2024.05.15
  • Published : 2024.08.31

Abstract

Reliable and fine-grained musical metadata are required for efficient search of rapidly increasing music files. In particular, since the primary motive for listening to music is its emotional effect, diversion, and the memories it awakens, emotion classification along with genre classification of music is crucial. In this paper, as an initial approach towards a "ground-truth" dataset for music emotion and genre classification, we elaborately generated a music corpus through labeling of a large number of ordinary people. In order to verify the suitability of the dataset through the classification results, we extracted features according to MPEG-7 audio standard and applied different machine learning models based on statistics and deep neural network to automatically classify the dataset. By using standard hyperparameter setting, we reached an accuracy of 93% for genre classification and 80% for emotion classification, and believe that our dataset can be used as a meaningful comparative dataset in this research field.

Keywords

References

  1. N. Ndou, R. Ajoodha and A. Jadhav, "Music Genre Classification: A Review of Deep-Learning and Traditional Machine-Learning Approaches," IEEE Intern. IOT, Electronics and Mechatronics Conf., Toronto, pp. 1-6, 2021
  2. K. Kosina, "Music genre recognition," Fach-hochschule Hagenberg, Tech. Rep., 2002.
  3. JS. Gomez-Canon et al. "TROMPA-MER: an open dataset for personalized music emotion recognition." Journal of Intelligent Information System, 60, pp. 549-570, 2023. https://doi.org/10.1007/s10844-022-00746-0
  4. P. N. Justin, Musical Emotions Explained, Oxford University Press, 2019.
  5. J. Kim and E. Andre, "Emotion recognition based on physiological changes in listening music," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 30 (12), pp. 2067-2083, December, 2008 https://doi.org/10.1109/TPAMI.2008.26
  6. K. R. Scherer, "Which emotions can be induced by music? what are the underlying: Mechanisms? and how can we measure them?" Journal of New Music Research, vol. 33, no. 3, pp. 239-251, 2004. https://doi.org/10.1080/0929821042000317822
  7. Y. Chen, Y. Yang, J. Wang and H. Chen, "The AMG1608 dataset for music emotion recognition." In: ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 693-697, 2015
  8. B. L. Sturm, "A Survey of Evaluation in Music Genre Recognition. Adaptive Multimedia Retrieval, 2012
  9. M. Genussov and I. Cohen, "Musical genre classification of audio signals using geometric methods. In: EUSIPCO, IEEE, S. 497-501, 2010
  10. J. Burred and A. Lerch, "A Hierarchical Approach To Automatic Musical Genre Classification." In: in Proc. Of the 6 th Int. Conf. on Digital Audio Effects, S. 8-11, 2003
  11. S. Brecheisen, H. Kriegel, P. Kunath and A. Pryakhin, "Hierarchical Genre Classification for Large Music Collections." In: Proceedings of the ICME 2006, July 9-12, Toronto, Canada, 1385-1388, 2006
  12. C. Lee, J. Shin, K. Yu and J. Su, "Automatic Music Genre Classification using Modulation Spectral Contrast Feature." In: Multimedia and Expo, 2007 July, 2007
  13. T. Li, M. Ogihara and Q. LI, "A Comparative Study on Content-based Music Genre Classification." In: Proceedings of the 26th ACM SIGIR, NY, USA, 2003
  14. T. Li, A. Chan and A. Chun, "Automatic musical pattern feature extraction using convolutional neural network." In Proc. Int. Conf. Data Mining and Applications. 2010
  15. T. Lidy and A. Schindler. "Parallel convolutional neural networks for music genre and mood classification", MIREX2016, 2016.
  16. K. Hevner, "Experimental Studies of the Elements of Expression in Music." XLVII (1936), S. 246-268, 1936
  17. R. R. Thayer, The Biopsychology of Mood and Arousal. Oxford University Press, USA, 1990
  18. D. Han, Y. Kong, J. Han et al. A survey of music emotion recognition. Front. Comput. Sci. 16, 166335, 2022
  19. M. Xu, X. Li, H. Xianyu, J. Tian, F. Meng and W. Chen. Multi-scale Approaches to the MediaEval 2015 "Emotion in Music" Task. MediaEval Workshop, Sept.14-15, 2015
  20. E. Coutinho, G Trigeorgis, S. Zafeirious and B. Schuller. "Automatically estimation emotion in music with deep long-short term memory recurrent neural networks." In:Proceeding of the MediaEval Workshop, Sept.14-15, 2015
  21. JAMENDO, www.jamendo.com.
  22. MUSOPEN, www.musopen.org
  23. E. Allamanche, J. Herre and O. Hellmuth, "MPEG-7 audio low level descriptors for audio identification," Proposal 6832, ISO/IECJTC1/SC29/ WG11(MPEG), 2001
  24. D.N. Jiang, L. Lu, H.J. Zhang, J.H. Tao, and L.H. Cai, "Music Type Classification by Spectral Contrast Features." In: Multimedia and Expo, 2002. ICME '02. Proceedings. 2002
  25. F. Pedregosa et al. "Scikit-learn:Machine learning in python," 2018.
  26. F. Chollet et al., "Keras," https://github.com/fchollet/keras, 2015.