DOI QR코드

DOI QR Code

Multimodal Media Content Classification using Keyword Weighting for Recommendation

추천을 위한 키워드 가중치를 이용한 멀티모달 미디어 콘텐츠 분류

  • Kang, Ji-Soo (Department of Computer Science, Kyonggi University) ;
  • Baek, Ji-Won (Department of Computer Science, Kyonggi University) ;
  • Chung, Kyungyong (Division of Computer Science and Engineering, Kyonggi University)
  • 강지수 (경기대학교 컴퓨터과학과) ;
  • 백지원 (경기대학교 컴퓨터과학과) ;
  • 정경용 (경기대학교 컴퓨터공학부)
  • Received : 2019.02.22
  • Accepted : 2019.05.20
  • Published : 2019.05.28

Abstract

As the mobile market expands, a variety of platforms are available to provide multimodal media content. Multimodal media content contains heterogeneous data, accordingly, user requires much time and effort to select preferred content. Therefore, in this paper we propose multimodal media content classification using keyword weighting for recommendation. The proposed method extracts keyword that best represent contents through keyword weighting in text data of multimodal media contents. Based on the extracted data, genre class with subclass are generated and classify appropriate multimodal media contents. In addition, the user's preference evaluation is performed for personalized recommendation, and multimodal content is recommended based on the result of the user's content preference analysis. The performance evaluation verifies that it is superiority of recommendation results through the accuracy and satisfaction. The recommendation accuracy is 74.62% and the satisfaction rate is 69.1%, because it is recommended considering the user's favorite the keyword as well as the genre.

모바일 시장의 확장과 함께 멀티모달 미디어 콘텐츠의 제공을 위한 플랫폼이 다양해지고 있다. 멀티모달 미디어 콘텐츠에는 이종데이터들이 복합적으로 포함되어 있어 사용자들이 선호 콘텐츠를 선택하기 위해 시간과 노력이 요구된다. 따라서 본 논문에서는 추천을 위한 키워드 가중치를 이용한 멀티모달 미디어 콘텐츠 분류를 제안한다. 제안하는 방법은 멀티모달 미디어 콘텐츠의 텍스트 데이터에서 키워드 가중치를 통해 콘텐츠를 가장 잘 나타내는 키워드를 추출한다. 추출된 키워드를 기반으로 서브클래스를 갖는 장르 클래스를 생성하고 이에 적절한 멀티모달 미디어 콘텐츠를 분류한다. 또한 개인화된 추천을 위해 사용자의 선호도 평가를 진행하여 사용자의 콘텐츠 선호도 분석 결과를 기반으로 멀티모달 콘텐츠를 추천한다. 성능평가는 추천 결과의 정확도와 만족도를 통해 우수함을 검증한다. 이는 사용자가 선호하는 장르와 키워드를 모두 고려하여 추천하기 때문에 정확도는 74.62%, 만족도는 69.1%로 높게 나타난다.

Keywords

JKOHBZ_2019_v9n5_1_f0001.png 이미지

Fig. 1. Sub-classes included {Thriller, Action} Class

JKOHBZ_2019_v9n5_1_f0002.png 이미지

Fig. 2. Process of Multimodal Media Content Classification using Keyword Weighting for Recommendation

Table 1. Movie Keyword Extracted through Text Mining

JKOHBZ_2019_v9n5_1_t0001.png 이미지

Table 2. Performance Results of Recommendation Method

JKOHBZ_2019_v9n5_1_t0002.png 이미지

References

  1. J. Park, M. Kim & S. Rho. (2015). A Study for Personalized Multimedia Information Services. The Journal of Society for e-Business Studies, 20(3), 79-87. https://doi.org/10.7838/jsebs.2015.20.3.079
  2. J. H. Kim, D. Lee & K. Chung. (2014). Recommendation based on Context-aware Model for Personalized u-Healthcare Service. Multimedia Tools and Applications, 71(2), 855-872. https://doi.org/10.1007/s11042-011-0920-0
  3. J. Son & S. B. Kim. (2017). Content-based filtering for recommendation systems using multiattribute networks. Expert Systems with Applications, 89, 404-412. https://doi.org/10.1016/j.eswa.2017.08.008
  4. Y. Wu & A. L. P. Chen. (2000). Index structures of user profiles for efficient Web page filtering services. Proceedings 20th IEEE International Conference on Distributed Computing Systems, 644-651. DOI : 10.1109/ICDCS.2000.840981
  5. M. Balabanovic & Y. Shoham. (1997). Fab : content-based, collaborative recommendation. Communications of the ACM, 40(3), 66-72. https://doi.org/10.1145/245108.245124
  6. A. Aizawa. (2003). An information-theoretic perspective of tf-idf measures. Information Processing &Management, 39(1), 45-65. https://doi.org/10.1016/S0306-4573(02)00021-3
  7. S. Choi, J. Jeon, B. Subrata & O. Kwon. (2015). An Efficient Estimation of Place Brand Image Power Based on Text Mining Technology. Journal of Intelligence and Information Systems, 21(2), 113-129. https://doi.org/10.13088/jiis.2015.21.2.113
  8. A. Hotho, A. Nürnberger & G. Paass. (2005). A Brief Survey of Text Mining. LDV Forum, 20, 19-62.
  9. T. Ritchey. (2006). Problem structuring using computer-aided morphological analysis. Journal of the Operational Research Society, 57(7), 792-801. https://doi.org/10.1057/palgrave.jors.2602177
  10. S. S. Kang & Y. T. Kim. (1994). Syllable-based model for the Korean morphology. Association for Computational Linguistics, 1(15), 221-226.
  11. V. Dhanalakshmi, M. Anandkumar, R. U. Rekha, C. Arunkumar, K. P. Soman & S. Rajendran. (2009). Morphological Analyzer for Agglutinative Languages Using Machine Learning Approaches. International Conference on Advances in Recent Technologies in Communication and Computing, 433-435. DOI : 10.1109/ARTCom.2009.184
  12. J. Park, M. Kim & S. Rho. (2015). A Study for Personalized Multimedia Information Services. The Jounal of Society for e-Business Studies, 20(3), 79-87.
  13. GroupLens. (2016). Movielens. https://grouplens.org/
  14. TMDB. (2016). The Movie Database. https://www.themoviedb.org/
  15. P. F. Brown, P. V. Desouza, R. L. Mercer, V. J. D. Pietra & J. C. Lai. (1992). Class-based n-gram models of natural language. Computational linguistics, 18(4), 467-479.
  16. G. Salton & C. Buckley. (1988). Term-weighting approaches in automatic text retrieval. Information processing & management, 24(5), 513-523. https://doi.org/10.1016/0306-4573(88)90021-0
  17. S. Thomaidou & M. Vazirgiannis. (2011). Multiword keyword recommendation system for online advertising. IEEE, 423-427. DOI : 10.1109/ASONAM.2011.70
  18. C. W. Hang & M. P. Singh. (2010). Trust-based recommendation based on graph similarity. In Proceedings of the 13th International Workshop on Trust in Agent Societies, 82.
  19. J. C. Kim & K. Chung. (2019). Prediction Model of User Physical Activity using Data Characteristics-based Long Short-term Memory Recurrent Neural Networks. KSII Transactions on Internet and Information Systems, 13(4), 2060-2077. https://doi.org/10.3837/tiis.2019.04.018
  20. J. C. Kim & K. Chung. (2019) Mining based Time-Series Sleeping Pattern Analysis for Life Big-data. Wireless Personal Communications, 105(2), 475-489. https://doi.org/10.1007/s11277-018-5983-z