DOI QR코드

DOI QR Code

Extension Method of Association Rules Using Social Network Analysis

사회연결망 분석을 활용한 연관규칙 확장기법

  • Lee, Dongwon (School of Business Administration, College of Social Sciences, Hansung University)
  • Received : 2017.07.31
  • Accepted : 2017.09.20
  • Published : 2017.12.31

Abstract

Recommender systems based on association rule mining significantly contribute to seller's sales by reducing consumers' time to search for products that they want. Recommendations based on the frequency of transactions such as orders can effectively screen out the products that are statistically marketable among multiple products. A product with a high possibility of sales, however, can be omitted from the recommendation if it records insufficient number of transactions at the beginning of the sale. Products missing from the associated recommendations may lose the chance of exposure to consumers, which leads to a decline in the number of transactions. In turn, diminished transactions may create a vicious circle of lost opportunity to be recommended. Thus, initial sales are likely to remain stagnant for a certain period of time. Products that are susceptible to fashion or seasonality, such as clothing, may be greatly affected. This study was aimed at expanding association rules to include into the list of recommendations those products whose initial trading frequency of transactions is low despite the possibility of high sales. The particular purpose is to predict the strength of the direct connection of two unconnected items through the properties of the paths located between them. An association between two items revealed in transactions can be interpreted as the interaction between them, which can be expressed as a link in a social network whose nodes are items. The first step calculates the centralities of the nodes in the middle of the paths that indirectly connect the two nodes without direct connection. The next step identifies the number of the paths and the shortest among them. These extracts are used as independent variables in the regression analysis to predict future connection strength between the nodes. The strength of the connection between the two nodes of the model, which is defined by the number of nodes between the two nodes, is measured after a certain period of time. The regression analysis results confirm that the number of paths between the two products, the distance of the shortest path, and the number of neighboring items connected to the products are significantly related to their potential strength. This study used actual order transaction data collected for three months from February to April in 2016 from an online commerce company. To reduce the complexity of analytics as the scale of the network grows, the analysis was performed only on miscellaneous goods. Two consecutively purchased items were chosen from each customer's transactions to obtain a pair of antecedent and consequent, which secures a link needed for constituting a social network. The direction of the link was determined in the order in which the goods were purchased. Except for the last ten days of the data collection period, the social network of associated items was built for the extraction of independent variables. The model predicts the number of links to be connected in the next ten days from the explanatory variables. Of the 5,711 previously unconnected links, 611 were newly connected for the last ten days. Through experiments, the proposed model demonstrated excellent predictions. Of the 571 links that the proposed model predicts, 269 were confirmed to have been connected. This is 4.4 times more than the average of 61, which can be found without any prediction model. This study is expected to be useful regarding industries whose new products launch quickly with short life cycles, since their exposure time is critical. Also, it can be used to detect diseases that are rarely found in the early stages of medical treatment because of the low incidence of outbreaks. Since the complexity of the social networking analysis is sensitive to the number of nodes and links that make up the network, this study was conducted in a particular category of miscellaneous goods. Future research should consider that this condition may limit the opportunity to detect unexpected associations between products belonging to different categories of classification.

연관 상품 추천은 수많은 상품을 다루는 온라인 상거래에서 소비자의 상품 탐색 시간을 줄여주며 판매자의 매출 증대에 크게 기여한다. 이는 주문과 같은 거래의 빈도를 기반으로 생성되므로, 통계적으로 판매 확률이 높은 상품을 효과적으로 선별할 수 있다. 하지만, 판매 가능성이 높은 경우라도 신상품처럼 판매 초기에 거래 건수가 충분하지 않은 상품은 추천에서 누락될 수 있다. 연관 추천에서 누락된 상품은 이로 인해 노출 기회를 잃게 되고, 이는 거래 건수 감소로 이어져, 또 다시 추천 기회를 잃는 악순환을 겪을 수도 한다. 따라서, 충분한 거래 건수가 쌓이기 전까지 초기 매출은 일정 기간 동안 정체되는 현상을 보이는데, 의류 등과 같이 유행에 민감하거나 계절 변화에 영향을 많이 받는 상품은 이로 인해 매출에 큰 타격을 입을 수도 있다. 본 연구는 이와 같이 거래 초기의 낮은 거래 빈도로 인해 잘 드러나지 않는 상품 간의 잠재적인 연관성을 찾아 추천 기회를 확보할 수 있도록 연관 규칙을 확장하기 위한 목적으로 수행되었다. 두 상품 간에 직접적인 연관성이 나타나지 않더라도 다른 상품을 매개로 두 상품 간의 잠재적 연관성을 예측할 수 있을 것이며, 이런 연관성은 주문에서 나타나는 상품 간 상호작용으로 표현될 수 있으므로, 사회연결망 분석을 활용한 분석을 시도하였다. 사회연결망 분석기법을 통해 각 상품의 속성과 두 상품 간 경로의 특성을 추출하고 회귀분석을 실시하여, 두 상품 간 경로의 최단 거리 및 경로의 개수, 각 상품이 얼마나 많은 상품과 연관성을 갖는지, 두 상품의 분류 카테고리가 어느 정도 일치하는지가 두 상품 간의 잠재적 연관성에 미친다는 것을 확인하였다. 모형의 성능을 평가하기 위해, 일정 기간의 주문 데이터로부터 연결망을 구성하고, 이후 10일 간 생성될 상품 간 연관성을 예측하는 실험을 진행하였다. 실험 결과는 모형을 적용하지 않는 경우보다 제안 모형을 활용할 때 훨씬 많은 연관성을 찾을 수 있음을 보여준다.

Keywords

References

  1. Agrawal, R., T. Imielinski, A. Swami. "Mining association rule between sets of items in large databases," Proc. 1993 ACM SIGMOD international conference on management of data, (1993), 207-216.
  2. Adomavicius, G., A. Tuzhilin. "Context-Aware Recommender Systems. Recommender Systems Handbook, Springer US, (2011), 217-253.
  3. Anand, S.S., A.R. Patrick. "A Data Mining methodology for cross-sales," Knowledge-Based Systems, Vol.10, No.7(1998), 449-461. https://doi.org/10.1016/S0950-7051(98)00035-5
  4. Ansari, A., S. Essegaier, R. Kohli. "Internet recommender systems," Journal of Marketing Research, Vol.37, No.3(2000), 363-375. https://doi.org/10.1509/jmkr.37.3.363.18779
  5. Balabanovic, M., Y. Shoham. "Content-Based, Collaborative, Recommendation," Communications of the ACM, Vol.40, No.3 (1997), 66-72. https://doi.org/10.1145/245108.245124
  6. Bodapati, A.V. "Recommender systems with purchase data," Journal of Marketing Research, Vol.45, No.1(2008), 77-93. https://doi.org/10.1509/jmkr.45.1.77
  7. Chen, Y.L., J.M. Chen, C.W. Tung. "A data mining approach for retail knowledge discovery with consideration of the effect of shelf-space adjacency on sales," Decision Support Systems, Vol.42, No.3(2006), 1503-1520. https://doi.org/10.1016/j.dss.2005.12.004
  8. Choi, S., Hyun, Y., Kim, N. "Improving Performance of Recommendation Systems Using Topic Modeling," Journal of Intelligence and Information Systems, Vol.21, No.3(2015), 101-116. https://doi.org/10.13088/jiis.2015.21.3.101
  9. Choi, S., Kwahk, K.-Y., Ahn, H. "Enhancing Predictive Accuracy of Collaborative Filtering Algorithms using the Network Analysis of Trust Relationship among Users," Journal of Intelligence and Information Systems, Vol.22, No.3(2016), 113-127. https://doi.org/10.13088/jiis.2016.22.3.113
  10. Fleder, D., K. Hosanagar. "Blockbuster culture's next rise or fall: The impact of recommender systems on sales diversity," Management Science, Vol.55, No.5(2009), 697-712. https://doi.org/10.1287/mnsc.1080.0974
  11. Kang, B. S., "A Novel Web Recommendation Method for New Customers Using Structural Holes in Social Networks," Journal of Industrial Economics and Business, Vol.23, No.5(2010), 2371-2385.
  12. Kim, H. K., Choi, I. Y., Ha, K. M., Kim, J. K. "Development of User Based Recommender System using Social Network for u-Healthcare," Journal of Intelligence and Information Systems, Vol.16. No.3(2010), 181-199.
  13. Kim, B. K., S. Lee, S. Bang, J. Kim, and J. H. Lee, "Personalized Recommendation System Using Social Network," Proceedings of the Conference on Intelligent Information Systems, Vol.20, No.1(2010), 48-49.
  14. Kim, J., Lee, S.-W. "The Ontology Based, the Movie Contents Recommendation Scheme, Using Relations of Movie Metadata," Journal of Intelligence and Information Systems, Vol.19, No.3(2013), 25-44. https://doi.org/10.13088/jiis.2013.19.3.025
  15. Kim, K.-J., Kim, B.-G. "Product Recommender System for Online Shopping Malls using Data Mining Techniques," Journal of Intelligence and Information Systems, Vol.11, No.1(2005), 191-205.
  16. Kim, M., and K. J. Kim, "Recommender Systems using Structural Hole and Collaborative Filtering," Journal of Intelligence and Information Systems, Vol.20, No.4(2014), 107-120. https://doi.org/10.13088/jiis.2014.20.4.107
  17. Kim, M. G., and K. J. Kim, " Recommender Systems using SVD with Social Network Information," Journal of Intelligence and Information Systems, Vol.22, No.4(2016), 1-18. https://doi.org/10.13088/JIIS.2016.22.4.001
  18. Kim, S. H., and R. S. Chang, "The Study on the Research Trend of Social Network Analysis and the its Applicability to Information Science," Journal of the Korean Society for Information Management, Vol.27, No.4 (2010), 71-87. https://doi.org/10.3743/KOSIM.2010.27.4.071
  19. Kim, Y., W.N. Street. "An intelligent system for customer targeting: a data mining approach," Decision Support Systems, Vol.37, No.2 (2004), 215-228. https://doi.org/10.1016/S0167-9236(03)00008-3
  20. Konstan, J.A., B.N. Miller, D. Maltz, J.L. Herlocker, L.R. Gordon, J. Riedl. "GroupLens: applying collaborative filtering to Usenet news," Communications of the ACM, Vol.40, No.3(1997), 77-87. https://doi.org/10.1145/245108.245126
  21. Lee, D., S. Park, S. Moon. "Utility-based association rule mining: A marketing solution for cross-selling," Expert Systems with Applications. Vol.40, No.7(2013), 2715-25. https://doi.org/10.1016/j.eswa.2012.11.021
  22. Noh, H., S. Choi, and H. Ahn, "Social Network-based Hybrid Collaborative Filtering using Genetic Algorithms," Journal of Intelligence and Information Systems, Vol.23, No.2(2017), 19-38. https://doi.org/10.13088/JIIS.2017.23.2.019
  23. Park, J. H., Y. H. Cho, and J. K. Kim, "Social Network : A Novel Approach to New Customer Recommendations," Journal of Intelligence and Information Systems, Vol.15, No.1(2009), 123-140.
  24. Shin, C. H., J. W. Lee, H. N. Yang, and I. Y. Choi, "The Research on Recommender for New Customers Using Collaborative Filtering and Social Network Analysis," Journal of Intelligence and Information Systems, Vol.18, No.4(2012), 19-42. https://doi.org/10.13088/JIIS.2012.18.4.019
  25. Yun, Y., and S. Chae, Introduction to Complex Systems, Samsung Economic Research Institute, 2005.
  26. Sohn D., Social Network Analysis, Kyungmoon Publications, 2002.
  27. Y. Kim, Social Network Analysis, Pakyoungsa, 2003.

Cited by

  1. 텍스트마이닝 기법을 활용한 국내 음식관광 연구 동향 분석 vol.35, pp.1, 2020, https://doi.org/10.7318/kjfc/2020.35.1.65
  2. 네트워크 중심성 척도가 추천 성능에 미치는 영향에 대한 연구 vol.27, pp.1, 2021, https://doi.org/10.13088/jiis.2021.27.1.023