DOI QR코드

DOI QR Code

Product Recommender Systems using Multi-Model Ensemble Techniques

다중모형조합기법을 이용한 상품추천시스템

  • 이연정 (동국대학교_서울 일반대학원 경영정보학과) ;
  • 김경재 (동국대학교_서울 경영학부)
  • Received : 2013.06.11
  • Accepted : 2013.06.18
  • Published : 2013.06.30

Abstract

Recent explosive increase of electronic commerce provides many advantageous purchase opportunities to customers. In this situation, customers who do not have enough knowledge about their purchases, may accept product recommendations. Product recommender systems automatically reflect user's preference and provide recommendation list to the users. Thus, product recommender system in online shopping store has been known as one of the most popular tools for one-to-one marketing. However, recommender systems which do not properly reflect user's preference cause user's disappointment and waste of time. In this study, we propose a novel recommender system which uses data mining and multi-model ensemble techniques to enhance the recommendation performance through reflecting the precise user's preference. The research data is collected from the real-world online shopping store, which deals products from famous art galleries and museums in Korea. The data initially contain 5759 transaction data, but finally remain 3167 transaction data after deletion of null data. In this study, we transform the categorical variables into dummy variables and exclude outlier data. The proposed model consists of two steps. The first step predicts customers who have high likelihood to purchase products in the online shopping store. In this step, we first use logistic regression, decision trees, and artificial neural networks to predict customers who have high likelihood to purchase products in each product group. We perform above data mining techniques using SAS E-Miner software. In this study, we partition datasets into two sets as modeling and validation sets for the logistic regression and decision trees. We also partition datasets into three sets as training, test, and validation sets for the artificial neural network model. The validation dataset is equal for the all experiments. Then we composite the results of each predictor using the multi-model ensemble techniques such as bagging and bumping. Bagging is the abbreviation of "Bootstrap Aggregation" and it composite outputs from several machine learning techniques for raising the performance and stability of prediction or classification. This technique is special form of the averaging method. Bumping is the abbreviation of "Bootstrap Umbrella of Model Parameter," and it only considers the model which has the lowest error value. The results show that bumping outperforms bagging and the other predictors except for "Poster" product group. For the "Poster" product group, artificial neural network model performs better than the other models. In the second step, we use the market basket analysis to extract association rules for co-purchased products. We can extract thirty one association rules according to values of Lift, Support, and Confidence measure. We set the minimum transaction frequency to support associations as 5%, maximum number of items in an association as 4, and minimum confidence for rule generation as 10%. This study also excludes the extracted association rules below 1 of lift value. We finally get fifteen association rules by excluding duplicate rules. Among the fifteen association rules, eleven rules contain association between products in "Office Supplies" product group, one rules include the association between "Office Supplies" and "Fashion" product groups, and other three rules contain association between "Office Supplies" and "Home Decoration" product groups. Finally, the proposed product recommender systems provides list of recommendations to the proper customers. We test the usability of the proposed system by using prototype and real-world transaction and profile data. For this end, we construct the prototype system by using the ASP, Java Script and Microsoft Access. In addition, we survey about user satisfaction for the recommended product list from the proposed system and the randomly selected product lists. The participants for the survey are 173 persons who use MSN Messenger, Daum Caf$\acute{e}$, and P2P services. We evaluate the user satisfaction using five-scale Likert measure. This study also performs "Paired Sample T-test" for the results of the survey. The results show that the proposed model outperforms the random selection model with 1% statistical significance level. It means that the users satisfied the recommended product list significantly. The results also show that the proposed system may be useful in real-world online shopping store.

전자상거래의 폭발적 증가는 소비자에게 더 유리한 많은 구매 선택의 기회를 제공한다. 이러한 상황에서 자신의 구매의사결정에 대한 확신이 부족한 소비자들은 의사결정 절차를 간소화하고 효과적인 의사결정을 위해 추천을 받아들인다. 온라인 상점의 상품추천시스템은 일대일 마케팅의 대표적 실현수단으로써의 가치를 인정받고 있다. 그러나 사용자의 기호를 제대로 반영하지 못하는 추천시스템은 사용자의 실망과 시간낭비를 발생시킨다. 본 연구에서는 정확한 사용자의 기호 반영을 통한 추천기법의 정교화를 위해 데이터마이닝과 다중모형조합기법을 이용한 상품추천시스템 모형을 제안하고자 한다. 본 연구에서 제안하는 모형은 크게 두 개의 단계로 이루어져 있으며, 첫 번째 단계에서는 상품군 별 우량고객 선정 규칙을 도출하기 위해서 로지스틱 회귀분석 모형, 의사결정나무 모형, 인공신경망 모형을 구축한 후 다중모형조합기법인 Bagging과 Bumping의 개념을 이용하여 세 가지 모형의 결과를 조합한다. 두 번째 단계에서는 상품군 별 연관관계에 관한 규칙을 추출하기 위하여 장바구니분석을 활용한다. 상기의 두 단계를 통하여 상품군 별로 구매가능성이 높은 우량고객을 선정하여 그 고객에게 관심을 가질만한 같은 상품군 또는 다른 상품군 내의 다른 상품을 추천하게 된다. 제안하는 상품추천시스템은 실제 운영 중인 온라인 상점인 'I아트샵'의 데이터를 이용하여 프로토타입을 구축하였고 실제 소비자에 대한 적용가능성을 확인하였다. 제안하는 모형의 유용성을 검증하기 위하여 제안 상품추천시스템의 추천과 임의 추천을 통한 추천의 결과를 사용자에게 제시하고 제안된 추천에 대한 만족도를 조사한 후 대응표본 T검정을 수행하였으며, 그 결과 사용자의 만족도를 유의하게 향상시키는 것으로 나타났다.

Keywords

References

  1. Adomavicius, G. and A. Tuzhilin, "Toward the next generation of recommender systems : a survey of the state-of-the-art and possible extensions," IEEE Transactions on Knowledge and Data Engineering, Vol.17, No.6(2005), 734-749. https://doi.org/10.1109/TKDE.2005.99
  2. Ahn, H. C., I. Han, and K. Kim, "The Product Recommender System Combining Association Rules and Classification Models : The Case of G Internet Shopping Mall," Information Systems Review, Vol.8, No.1(2006), 181-201.
  3. Breiman, L., "Heuristics of instability in model selection," Technical Report, Statistics Department, University of California at Berkeley, 1994.
  4. Breiman, L., "Bagging predictors," Machine Learning, Vol.24, No.2(1996), 123-140.
  5. Cho, Y. H. and J. K. Kim, "Application of Web usage mining and product taxonomy to collaborative recommendations in e-commerce," Expert Systems with Applications, Vol.26 (2004), 233-246. https://doi.org/10.1016/S0957-4174(03)00138-6
  6. Cho, Y. H., J. K. Kim, and S. H. Kim, "A personalized recommender system based on Web usage mining and decision tree induction," Expert Systems with Applications, Vol.23(2002), 329-342.
  7. Cho, Y. H. and J. H. Bang, "Applying Centrality Analysis to Solve the Cold-Start and Sparsity Problems in Collaborative Filtering," Journal of Intelligence and Information Systems, Vol.17, No.3(2009), 183-199.
  8. Cho, Y. H., S. K. Park, D. H. Ahn, and J. K. Kim, "Collaborative Recommendations using Adjusted Product Hierarchy : Methodology and Evaluation," Journal of the Korean Operations Research and Management Science Society, Vol.29, No.2(2004), 59-75.
  9. Goldberg, D., D. Nichols, B. M. Oki, and D. Terry, "Using collaborative filtering to weave an information tapestry," Communications of the ACM, Vol.35, No.12(1992), 61-70.
  10. Heskes, T., "Balancing between bagging and bumping," Advances in Neural Information Processing Systems, Cambridge, MIT Press, (1996), 466-472.
  11. Kang, B., "Collaborative Filtering System using Self-Organizing Map for Web Personalization," Journal of Intelligence and Information Systems, Vol.9, No.3(2003), 117-135.
  12. Kim, D. and B.-J. Yum, "Collaborative filtering based on iterative principal component analysis," Expert Systems with Applications, Vol.28, No.4(2005), 823-830. https://doi.org/10.1016/j.eswa.2004.12.037
  13. Kim, J. K., Y. H. Cho, W. J. Kim, J. R. Kim, and J. H. Suh, "A personalized recommendation procedure for Internet shopping support," Electronic Commerce Research and Applications, Vol.1(2002a), 301-313. https://doi.org/10.1016/S1567-4223(02)00022-4
  14. Kim, J. K., D. H. Ahn, and Y. H. Cho, "A Personalized Recommender System, WebCF-PT : A Collaborative Filtering using Web Mining and Product Taxonomy," Asia Pacific Journal of Information Systems, Vol.15, No.1( 2005), 63-79.
  15. Kim, J. K., D. H. Ahn, and Y. H. Cho, "Development of a personalized recommendation procedure based on data mining techniques for internet shopping malls," Journal of Intelligence and Information Systems, Vol.9, No.3(2003), 177-191.
  16. Kim, J. K., J. H. Suh, D. H. Ahn, and Y. H. Cho, "A personalized recommendation methodology based on collaborative filtering," Journal of Intelligence and Information Systems, Vol.8, No.2(2002b), 139-157.
  17. Kim, J. W., S. J. Bae, and H. J. Lee, "Sparsity Effect on Collaborative Filtering-based Personalized Recommendation," Asia Pacific Journal of Information Systems, Vol.14, No.2(2004), 131-149. https://doi.org/10.1111/j.1365-2575.2004.00167.x
  18. Kim, K. and H. Ahn, "Collaborative filtering with a user-item matrix reduction technique for recommender systems," International Journal of Electronic Commerce, Vol.16, No.1(2011), 107-128. https://doi.org/10.2753/JEC1086-4415160104
  19. Kim, K. and. B. Kim, "Product Recommender System for Online Shopping Malls using Data Mining Techniques," Journal of Intelligence and Information Systems, Vol.11, No.1(2005), 191-205.
  20. Konstan, j., B. Miller, D. Maltz, J. Herlocker, L. Gordon, and J. Riedl, "GroupLens : Applying Collaborative Filtering to Usenet News," Communication of the ACM, Vol.40(1997), 77-87. https://doi.org/10.1145/245108.245126
  21. Lee, Y. and S. Kwak, "A study on training ensembles of neural networks : a case of stock price prediction," Journal of Intelligence and Information Systems, Vol.5, No.1(1999), 95-101.
  22. Park, J. H., Y. H. Cho, and J. K. Kim, "Social Network : A Novel Approach to New Customer Recommendations," Journal of Intelligence and Information Systems, Vol.15, No.1(2009), 123-140.
  23. Pazzani, M. J., "A framework for collaborative, content- based and demographic filtering," Artificial Intelligence Review, Vol.13, No.5-6(1999), 393-408. https://doi.org/10.1023/A:1006544522159
  24. Resnick, P., N. Iacovou, M. Suchak, and P. Bergstrom, "GroupLens : An open architecture for collaborative filtering of netnews," Proceedings of the ACM Conference on Computer Supported Cooperative Work, (1994), 175-186.
  25. Roh, T. H., K. J. Oh, and I. Han, "The collaborative filtering recommendation based on SOM cluster-indexing CBR," Expert Systems with Applications, Vol.25, No.3(2003), 413-423. https://doi.org/10.1016/S0957-4174(03)00067-8
  26. Sarwar, B. M., G. Karypis, J. A. Konstan, and J. Riedl, "Analysis of recommendation algorithms for e-commerce," Proceedings of Conference on ACM, (2000), 158-167.
  27. Tibshirani, R. and K. Knight, "Model search and inference by bootstrap 'bumping'," Technical Report, University of Toronto, 1995.
  28. Wu, K.-L., C. C. Aggarwal, and P. S. Yu, "Personalization with dynamic profiler," Proceedings of the Third International Workshop on Advanced Issues of E-commerce and Web-based Information Systems, (2001), 12-20.

Cited by

  1. Methodology for Issue-related R&D Keywords Packaging Using Text Mining vol.16, pp.2, 2015, https://doi.org/10.7472/jksii.2015.16.2.57
  2. VOC 기반 연관규칙 마이닝을 이용한 통신선로설비의 장애 예측 vol.11, pp.4, 2015, https://doi.org/10.17662/ksdim.2015.11.4.013
  3. 은행 금융상품에서 프라이빗 뱅커의 전문투자형 사모펀드 추천 의사결정 vol.28, pp.4, 2013, https://doi.org/10.5859/kais.2019.28.4.333
  4. EnPSO: An AutoML Technique for Generating Ensemble Recommender System vol.46, pp.9, 2013, https://doi.org/10.1007/s13369-021-05670-z