Predicting the Performance of Recommender Systems through Social Network Analysis and Artificial Neural Network

사회연결망분석과 인공신경망을 이용한 추천시스템 성능 예측

  • Cho, Yoon-Ho (School of Management Information Systems, Kookmin University) ;
  • Kim, In-Hwan (Department of e-Business, Graduate School of Kookmin University)
  • 조윤호 (국민대학교 경영대학 경영정보학부) ;
  • 김인환 (국민대학교 대학원 e-비즈니스학과)
  • Received : 2010.11.21
  • Accepted : 2010.12.06
  • Published : 2010.12.31

Abstract

The recommender system is one of the possible solutions to assist customers in finding the items they would like to purchase. To date, a variety of recommendation techniques have been developed. One of the most successful recommendation techniques is Collaborative Filtering (CF) that has been used in a number of different applications such as recommending Web pages, movies, music, articles and products. CF identifies customers whose tastes are similar to those of a given customer, and recommends items those customers have liked in the past. Numerous CF algorithms have been developed to increase the performance of recommender systems. Broadly, there are memory-based CF algorithms, model-based CF algorithms, and hybrid CF algorithms which combine CF with content-based techniques or other recommender systems. While many researchers have focused their efforts in improving CF performance, the theoretical justification of CF algorithms is lacking. That is, we do not know many things about how CF is done. Furthermore, the relative performances of CF algorithms are known to be domain and data dependent. It is very time-consuming and expensive to implement and launce a CF recommender system, and also the system unsuited for the given domain provides customers with poor quality recommendations that make them easily annoyed. Therefore, predicting the performances of CF algorithms in advance is practically important and needed. In this study, we propose an efficient approach to predict the performance of CF. Social Network Analysis (SNA) and Artificial Neural Network (ANN) are applied to develop our prediction model. CF can be modeled as a social network in which customers are nodes and purchase relationships between customers are links. SNA facilitates an exploration of the topological properties of the network structure that are implicit in data for CF recommendations. An ANN model is developed through an analysis of network topology, such as network density, inclusiveness, clustering coefficient, network centralization, and Krackhardt's efficiency. While network density, expressed as a proportion of the maximum possible number of links, captures the density of the whole network, the clustering coefficient captures the degree to which the overall network contains localized pockets of dense connectivity. Inclusiveness refers to the number of nodes which are included within the various connected parts of the social network. Centralization reflects the extent to which connections are concentrated in a small number of nodes rather than distributed equally among all nodes. Krackhardt's efficiency characterizes how dense the social network is beyond that barely needed to keep the social group even indirectly connected to one another. We use these social network measures as input variables of the ANN model. As an output variable, we use the recommendation accuracy measured by F1-measure. In order to evaluate the effectiveness of the ANN model, sales transaction data from H department store, one of the well-known department stores in Korea, was used. Total 396 experimental samples were gathered, and we used 40%, 40%, and 20% of them, for training, test, and validation, respectively. The 5-fold cross validation was also conducted to enhance the reliability of our experiments. The input variable measuring process consists of following three steps; analysis of customer similarities, construction of a social network, and analysis of social network patterns. We used Net Miner 3 and UCINET 6.0 for SNA, and Clementine 11.1 for ANN modeling. The experiments reported that the ANN model has 92.61% estimated accuracy and 0.0049 RMSE. Thus, we can know that our prediction model helps decide whether CF is useful for a given application with certain data characteristics.

협업필터링 추천은 다양한 분야에서 활용되고 있지만 트랜잭션 데이터의 성격에 따라 추천 성능에 현저한 차이를 보이고 있다. 기존 연구에서는 이러한 추천 성능의 차이가 나타나는 이유에 대한 설명을 구체적으로 제시하지 못하고 있고 이에 따라 추천 성능의 예측 또한 연구된 바가 없다. 본 연구는 사회네트워크분석과 인공신경망 모형을 이용하여 협업필터링 추천시스템의 성능을 예측하고자 한다. 본 연구의 목적을 달성하기 위해 국내 백화점의 트랜잭션 데이터를 기반으로 형성되는 고객간 사회 네트워크의 구조적 지표를 측정한 후 이를 기반으로 인공신경망 모형을 구축하고 검증한다. 본 연구는 협업필터링 추천 성능을 예측할 수 있는 새로운 모형을 제시하였다는 점에서 그 의의가 있으며 이를 통해 기업들의 협업필터링 추천시스템 도입에 대한 의사결정에 도움을 줄 수 있을 것으로 기대된다.

Keywords

References

  1. 김용학, 사회연결망 분석, 박영사, 2003.
  2. 김재경, 조윤호, 김승태, 김혜경, "모바일 전자상거래 환경에 적합한 개인화된 추천시스템", 경영정보학연구, 15권 3호(2005), 223-241.
  3. 손동원, 사회네트워크 분석, 경문사, 2002.
  4. 박상길, 이해진, 심현진, 이정윤, 오재응, "회귀모형과 신경망모형을 이용한 차량공조시스템의 음질 인덱스 구축", 한국소음진동공학회 춘계학술대회논문집, (2006), 1-6.
  5. 박종학, 조윤호, 김재경, "사회연결망 : 신규고객 추천문제의 새로운 접근법", 지능정보연구, 15권 1호(2009), 123-139.
  6. 안현철, 김경재, 한인구, "다분류 Support Vector Machine을 이용한 한국 기업의 지능형 기업 채권평가모형", 경영학연구, 35권 5호(2006), 1479-1496.
  7. 조영빈, 조윤호, "구매순서를 고려한 개선된 협업필터링 방법론", 지능정보연구, 13권 2호(2007), 69-80.
  8. 조윤호, 방정혜, "신상품 추천을 위한 사회연결망 분석의 활용", 지능정보연구, 15권 4호(2009), 183-200.
  9. Adomavicious, G. and A. Tuzhilin, "Toward the next generation of recommender systems : A survey of the state-of-the-art and possible extensions", IEEE Transactions on Knowledge and Data Engineering, Vol.17, No.6(2005), 734-749. https://doi.org/10.1109/TKDE.2005.99
  10. Amorim, S., J. P. Barthelemy, and C. Ribeiro, "Clustering and clique partitioning : simulated annealing and tabu search approaches", Journal of Classification, Vol.9(1992), 17-41. https://doi.org/10.1007/BF02618466
  11. Bonacich, P., "Power and Centrality : A Family of Measures", American Journal of Sociology, Vol.92(1987), 1170-1182. https://doi.org/10.1086/228631
  12. Breiger, R., S. Boorman, and P. Arabie, "An algorithm for clustering relational data, with applications to social network analysis and comparison with multi-dimensional scaling", Journal of Mathematical Psychology, Vol.12(1975), 328-383. https://doi.org/10.1016/0022-2496(75)90028-0
  13. Bron, C. and Kerbosch, J., "Finding all cliques of an undirected graph", Communication of the ACM, Vol.16(1973), 575-577. https://doi.org/10.1145/362342.362367
  14. Burt, R. S., Structure 4.1 Reference Manual, NY : Columbia University, 1991.
  15. Burt, R. S., Structural Holes:The Social Structure of Competition, Cambridge, MA : Harvard University Press, 1992.
  16. Cho, Y. H. and J. K. Kim, "Application of web usage mining and product taxonomy to collaborative recommendations in e-commerce", Expert Systems with Applications, Vol.26, No.3(2004), 234-246.
  17. Frank, O. and F. Harary, "Cluster Inference by Using Transitivity Indices in Empirical Graphs", Journal of the American Statistical Association, Vol.77, No.380(1982), 835-840. https://doi.org/10.2307/2287315
  18. Freeman, L., "Centrality in Social Networks : Conceptual clarification", Social Networks, Vol.1(1979), 215-239.
  19. Huang, Z. and D. Zeng, "Why Does Collaborative Filtering Work?-Recommendation Model Validation and Selection by Analyzing Random Bipartite Graphs", the Fifteenth Annual Workshop on Information Technologies and Systems, 2005.
  20. Huang, Z., D. Zeng, and H. Chen, "A Comparative Study of Recommendation Algorithms in E-commerce Applications", IEEE Intelligent Systems, Vol.22, No.5(2007), 68-78. https://doi.org/10.1109/MIS.2007.4338497
  21. Human, S. E. and K. G. Provan, "Legitimacy Building in the Evolution of Small-Firm Multilateral Networks : A Comparative Study of Success and Demise", Administrative Science Quarterly, Vol.45, No.2(2000), 327-365. https://doi.org/10.2307/2667074
  22. Krackhardt, D., "Graph Theorical Dimentions of Informal Organizations", In Kathleen Carley and Michael Prietula(eds.), Computational Organizational Theory, Hillsdale, NJ : Lawrence Erilbaum Associations, (1994), 89-111.
  23. Kukkonen, H. O., K. Lyytinen, and Y. J. Yoo, "Social Networks and Information Systems : Ongoing and Future Research Streams", Journal of the Association for Information Systems, Vol.11(2010), 61-68. https://doi.org/10.17705/1jais.00222
  24. Liu, F. and H. J. Lee, "Use of social network information to enhance collaborative filtering performance", Expert Systems with Applications, Vol.37(2010), 4772-4778. https://doi.org/10.1016/j.eswa.2009.12.061
  25. Melville, P., R. J. Mooney, and R. Nagarajan, "Content-boosted Collaborative Filtering", Proceeding SIGIR 2001 Workshop on Recommender Systems, 2001.
  26. Oh, W., J. Choi, and K. Kim, "Coauthorship Dynamics and Knowledge Capital : The Patterns of Cross-disciplinary Collaboration in Information System Research", Journal of Management Information System, Vol.22, No.3(2006), 265-292.
  27. Ryu, Y. U., H. K. Kim, Y. H. Cho, and J. K. Kim, "Peer-oriented content recommendation in a social network", Proceedings of the Sixteenth Workshop on Information Technologies and Systems, (2006), 115-120.
  28. Sarwar, B., G. Karypis, J. A. Konstan, and J. Riedl, "Analysis of recommendation algorithms for e-commerce", Proceedings of ACM E-commerce Conference, (2000), 158-167.
  29. Schank, T. and D. Wagner, "Approximating Clustering Coefficient and Transitivity", JGAA, Vol.9, No.2(2005), 265-275. https://doi.org/10.7155/jgaa.00108
  30. Scott, J., Social Network Analysis : A Handbook, Thousand Oaks, CA : Sage, 2000.
  31. Seidman, S. B. and B. L. Foster, "A note on the potential for genuine cross-fertilization between anthropology and mathematics", Social Networks, Vol.1(1978), 65-72. https://doi.org/10.1016/0378-8733(78)90013-8
  32. Su, X. and T. M. Khoshgoftaar, "A Survey of Collaborative Filtering Techniques", Advances in Artificial Intelligence, Vol.2009(2009).
  33. Wasserman, S. and K. Faust, "Social Network Analysis : Methods and Application", Newyork : Cambridge University Press, 1994.
  34. Watts, D. J., Small worlds, Princeton University Press, Princeton, New Jersey, 1999.
  35. Weare, C., Loges, W. E., Oztas, N., "Email Effects on the Structure of Local Associations : A Social Network Analysis", Social Science Quarterly, Vol.88, No.1(2007), 222-243. https://doi.org/10.1111/j.1540-6237.2007.00455.x