DOI QR코드

DOI QR Code

Content-based Recommendation Based on Social Network for Personalized News Services

개인화된 뉴스 서비스를 위한 소셜 네트워크 기반의 콘텐츠 추천기법

  • Hong, Myung-Duk (Department of Computer and Information Engineering, Inha University) ;
  • Oh, Kyeong-Jin (Department of Computer and Information Engineering, Inha University) ;
  • Ga, Myung-Hyun (Department of Computer and Information Engineering, Inha University) ;
  • Jo, Geun-Sik (Department of Computer and Information Engineering, Inha University)
  • 홍명덕 (인하대학교 컴퓨터정보공학과) ;
  • 오경진 (인하대학교 컴퓨터정보공학과) ;
  • 가명현 (인하대학교 컴퓨터정보공학과) ;
  • 조근식 (인하대학교 컴퓨터정보공학과)
  • Received : 2013.08.16
  • Accepted : 2013.09.17
  • Published : 2013.09.30

Abstract

Over a billion people in the world generate new news minute by minute. People forecasts some news but most news are from unexpected events such as natural disasters, accidents, crimes. People spend much time to watch a huge amount of news delivered from many media because they want to understand what is happening now, to predict what might happen in the near future, and to share and discuss on the news. People make better daily decisions through watching and obtaining useful information from news they saw. However, it is difficult that people choose news suitable to them and obtain useful information from the news because there are so many news media such as portal sites, broadcasters, and most news articles consist of gossipy news and breaking news. User interest changes over time and many people have no interest in outdated news. From this fact, applying users' recent interest to personalized news service is also required in news service. It means that personalized news service should dynamically manage user profiles. In this paper, a content-based news recommendation system is proposed to provide the personalized news service. For a personalized service, user's personal information is requisitely required. Social network service is used to extract user information for personalization service. The proposed system constructs dynamic user profile based on recent user information of Facebook, which is one of social network services. User information contains personal information, recent articles, and Facebook Page information. Facebook Pages are used for businesses, organizations and brands to share their contents and connect with people. Facebook users can add Facebook Page to specify their interest in the Page. The proposed system uses this Page information to create user profile, and to match user preferences to news topics. However, some Pages are not directly matched to news topic because Page deals with individual objects and do not provide topic information suitable to news. Freebase, which is a large collaborative database of well-known people, places, things, is used to match Page to news topic by using hierarchy information of its objects. By using recent Page information and articles of Facebook users, the proposed systems can own dynamic user profile. The generated user profile is used to measure user preferences on news. To generate news profile, news category predefined by news media is used and keywords of news articles are extracted after analysis of news contents including title, category, and scripts. TF-IDF technique, which reflects how important a word is to a document in a corpus, is used to identify keywords of each news article. For user profile and news profile, same format is used to efficiently measure similarity between user preferences and news. The proposed system calculates all similarity values between user profiles and news profiles. Existing methods of similarity calculation in vector space model do not cover synonym, hypernym and hyponym because they only handle given words in vector space model. The proposed system applies WordNet to similarity calculation to overcome the limitation. Top-N news articles, which have high similarity value for a target user, are recommended to the user. To evaluate the proposed news recommendation system, user profiles are generated using Facebook account with participants consent, and we implement a Web crawler to extract news information from PBS, which is non-profit public broadcasting television network in the United States, and construct news profiles. We compare the performance of the proposed method with that of benchmark algorithms. One is a traditional method based on TF-IDF. Another is 6Sub-Vectors method that divides the points to get keywords into six parts. Experimental results demonstrate that the proposed system provide useful news to users by applying user's social network information and WordNet functions, in terms of prediction error of recommended news.

세계에는 수많은 사람들이 살아가고 있고, 사람들의 일상으로부터 매일, 매 시간 단위로 새로운 뉴스가 발생한다. 발생되는 뉴스는 예정된 일과 예상하지 못한 일들을 포함하고 있다. 발생하는 뉴스의 거대한 양과 이를 전달하는 수많은 미디어들로 인해 사람들은 뉴스 콘텐츠를 이용하는데 많은 시간을 소비하게 된다. 하지만 미디어에 시시각각 나타나는 속보와 실시간 이슈의 대부분이 가십 기사로 이루어져 있어 사용자들이 자신의 성향에 맞는 뉴스를 선별하고, 뉴스로부터 정보를 획득하는 것은 쉽지 않은 일이다. 또한 사용자의 관심사가 시간에 따라 변하기 때문에 뉴스 제공에 있어 사용자의 변하는 관심사를 반영하는 것이 요구된다. 본 논문에서는 사용자의 최근 관심사를 기반으로 사용자 선호도에 맞는 뉴스를 제공하기 위한 콘텐츠 기반의 추천 기법 및 시스템을 제안한다. 사용자의 최근 선호도를 파악하기 위하여 소셜 네트워크 서비스인 Facebook 사용자의 정보와 최근 게시글을 이용하여 동적으로 사용자 프로파일을 생성하여 이를 뉴스 서비스에 활용하고, 사용자 선호도에 적합한 뉴스를 추출하기 위해서 뉴스 콘텐츠의 분석을 요구한다. 뉴스 콘텐츠 분석을 위해 미디어에서 제공되는 뉴스의 카테고리를 사용하고, 뉴스 방송원고의 분석 및 주요 키워드 추출을 통해 뉴스 프로파일을 생성한다. 사용자 프로파일과 뉴스 프로파일 간의 유사도 측정을 위해서는 두 프로파일 간 형식의 일치화가 요구되므로 사용자 프로파일을 뉴스 프로파일과 동일한 형태로 생성한다. 사용자가 시스템에 접속하면 시스템은 사용자 프로파일에 명시된 선호도를 기반으로 뉴스 프로파일과의 유사도를 측정하고, 사용자 선호도에 가장 적합한 뉴스들을 제공하게 된다. 또한 사용자에게 제공된 뉴스 프로파일과 다른 뉴스 프로파일들 간에 유사도를 측정하여 유사도가 높은 관련된 뉴스들을 제공하게 된다. 제안한 개인화된 뉴스 서비스의 성능을 평가하기 위해 사용자에게 추천된 뉴스에 대한 사용자 평가와 시스템 예측값의 오차를 기반으로 6Sub-Vectors 벤치마크 알고리즘과 성능 평가를 수행하였고, 실험 결과를 통해 제안한 시스템의 우수성을 입증하였다.

Keywords

References

  1. Abel, F., E. Herder, G. J. Houben, N. Henze, and D. Krause, "Cross-system user modeling and personalization on the Social Web," User Modeling and User-Adapted Interaction, Springer Netherlands, Vol.23, No.2-3(2013), 169-209. https://doi.org/10.1007/s11257-012-9131-2
  2. Bollacker, K., C. Evans, P. Paritosh, T. Sturge, and J. Taylor, "Freebase : a collaboratively created graph database for structuring human knowledge," Proceedings of the 2008 ACM SIGMOD international conference on Management of data, (2008), 1247-1250.
  3. Cunningham, H., "GATE-a general architecture for text engineering," Journal of Computers and the Humanities, Vol.36, No.2(2002), 223-254. https://doi.org/10.1023/A:1014348124664
  4. Coleman, M. and T. L. Liau, "A computer readability formula designed for machine scoring," Journal of Applied Psychology, Vol.60, No.2(1975), 283-284. https://doi.org/10.1037/h0076540
  5. Ek, T., C. Kirkegaard, H. Jonsson and P. Nugues, "Named Entity Recognition for Short Text Messages," Procedia-Social and Behavioral Sciences, Vol.27(2011), 178-187. https://doi.org/10.1016/j.sbspro.2011.10.596
  6. Hong, K., H. Jeon and C. Jeon, "UserProfile-based personalized research paper recommendation system," Proceeding of 8th International Conference on Computing and Networking Technology, (2012), 134-138.
  7. Kompan, M. and M. Bielikova, "Content-Based News Recommendation," Proceedings of International Conference on E-Commerce and Web Technologies, (2010), 61-72.
  8. Lekakos, G. and P. Caravelas, "A hybrid approach for movie recommendation," Multimedia tools and applications, Vol.36. No.1-2(2008), 55-70. https://doi.org/10.1007/s11042-006-0082-7
  9. Li, L., D. Wang, T. Li, D. Know, and B. Padmanabhan, "SCENE : a scalable two-stage personalized news recommendation system," Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval, (2011), 125-134.
  10. Liu, J., P. Dolan, and E. R. Pedersen, "Personalized News Recommendation Based on Click Behavior," Proceeding of international conference on Intelligent user interfaces, (2010), 31-40.
  11. Miller, G. A., "WordNet : a lexical database for English," Communications of the ACM, Vol. 38, No.11(1995), 39-41.
  12. Peter, B. and N. C. David, "Preface to the Special Issue on Personalization in Social Web systems," User Modeling and User-Adapted Interaction, Springer Netherlands, Vol.23, No. 2-3(2013), 83-87. https://doi.org/10.1007/s11257-012-9133-0
  13. Powers, S. and M. El-Nawawy, "Al-Jazeera English and global news networks : clash of civilizations or cross-cultural dialogue," Journal of Media, War and Conflict, Vol.2, No.3(2009), 263-284. https://doi.org/10.1177/1750635209345185
  14. Rao, J., A. Jia, Y. Feng, and D. Zhao, "Personalized News Recommendation Using Ontologies Harvested from the Web," Proceedings of International Conference on Web-Age Information Management, (2013), 781-787.
  15. Sean, V., K.-J. Oh, and G.-S. Jo, "A User Profilebased Filtering Method for Information Search in Smart TV Environment," Journal of Intelligence and Information Systems, Vol.18, No.3(2012), 97-117.
  16. Thay, S., I. Ha, and G.-S. Jo, "Incorporating Social Relationship discovered from User's Behavior into Collaborative Filtering," Journal of Intelligence and Information Systems, Vol.19, No. 2(2013), 1-20.
  17. You, T., A. N. Rosli, I. Ha, and G.-S. Jo, "Clustering Method based on Genre Interest for Cold-Start Problem in Movie Recommendation," Journal of Intelligence and Information Systems, Vol.19, No.1(2013), 57-77.
  18. Zhang, L., Webpage Personalization and User Profiling, Computational Advertising Workshop at SAMSI, 2012. Available at http://www.samsi.info/sites/default/files/liang_august2012.pdf(Downloaded 23 September, 2013).

Cited by

  1. Ontology-based Course Mentoring System vol.20, pp.2, 2014, https://doi.org/10.13088/jiis.2014.20.2.149
  2. Battle Recommendation Algorithm for Playerunknown’s Battleground Game vol.19, pp.11, 2013, https://doi.org/10.9728/dcs.2018.19.11.2067