The Prediction of Purchase Amount of Customers Using Support Vector Regression with Separated Learning Method

Support Vector Regression에서 분리학습을 이용한 고객의 구매액 예측모형

  • Hong, Tae-Ho (School of Business, Pusan National University) ;
  • Kim, Eun-Mi (School of Business, Pusan National University)
  • Received : 2010.11.24
  • Accepted : 2010.12.08
  • Published : 2010.12.31

Abstract

Data mining has empowered the managers who are charge of the tasks in their company to present personalized and differentiated marketing programs to their customers with the rapid growth of information technology. Most studies on customer' response have focused on predicting whether they would respond or not for their marketing promotion as marketing managers have been eager to identify who would respond to their marketing promotion. So many studies utilizing data mining have tried to resolve the binary decision problems such as bankruptcy prediction, network intrusion detection, and fraud detection in credit card usages. The prediction of customer's response has been studied with similar methods mentioned above because the prediction of customer's response is a kind of dichotomous decision problem. In addition, a number of competitive data mining techniques such as neural networks, SVM(support vector machine), decision trees, logit, and genetic algorithms have been applied to the prediction of customer's response for marketing promotion. The marketing managers also have tried to classify their customers with quantitative measures such as recency, frequency, and monetary acquired from their transaction database. The measures mean that their customers came to purchase in recent or old days, how frequent in a period, and how much they spent once. Using segmented customers we proposed an approach that could enable to differentiate customers in the same rating among the segmented customers. Our approach employed support vector regression to forecast the purchase amount of customers for each customer rating. Our study used the sample that included 41,924 customers extracted from DMEF04 Data Set, who purchased at least once in the last two years. We classified customers from first rating to fifth rating based on the purchase amount after giving a marketing promotion. Here, we divided customers into first rating who has a large amount of purchase and fifth rating who are non-respondents for the promotion. Our proposed model forecasted the purchase amount of the customers in the same rating and the marketing managers could make a differentiated and personalized marketing program for each customer even though they were belong to the same rating. In addition, we proposed more efficient learning method by separating the learning samples. We employed two learning methods to compare the performance of proposed learning method with general learning method for SVRs. LMW (Learning Method using Whole data for purchasing customers) is a general learning method for forecasting the purchase amount of customers. And we proposed a method, LMS (Learning Method using Separated data for classification purchasing customers), that makes four different SVR models for each class of customers. To evaluate the performance of models, we calculated MAE (Mean Absolute Error) and MAPE (Mean Absolute Percent Error) for each model to predict the purchase amount of customers. In LMW, the overall performance was 0.670 MAPE and the best performance showed 0.327 MAPE. Generally, the performances of the proposed LMS model were analyzed as more superior compared to the performance of the LMW model. In LMS, we found that the best performance was 0.275 MAPE. The performance of LMS was higher than LMW in each class of customers. After comparing the performance of our proposed method LMS to LMW, our proposed model had more significant performance for forecasting the purchase amount of customers in each class. In addition, our approach will be useful for marketing managers when they need to customers for their promotion. Even if customers were belonging to same class, marketing managers could offer customers a differentiated and personalized marketing promotion.

본 연구에서는 기업의 마케팅 프로모션에 따른 반응고객의 구매액 예측을 위한 방법을 제시하고 SVR의 효과적인 학습방법을 제시하였다. 프로모션에 의한 고객의 구매액을 기반으로 고객을 5등급으로 등급화하고 각 등급 내에서 SVR을 적용하여 고객의 구매액을 예측하였다. 본 연구에서 제안하는 예측된 고객의 등급 내에서 고객 구매액을 예측하는 분리데이터 학습법이 프로모션에 반응한 모든 고객을 대상으로 구매액을 예측하는 전체데이터 학습법보다 높은 예측성과를 보여주었다. 일반적으로 세분화된 고객집단을 하나의 집단으로 보고 동일한 마케팅 전략을 제시하나 본 연구를 통해 구매액에 따라 등급화 된 고객의 등급 내에서 다시 고객의 거래 구매액을 예측하여 동일한 집단 내에서도 차별화된 마케팅 전략을 제시할 수 있는 기반을 제시하였다. 즉 동일한 등급에서도 고객 구매액에 따라 고객의 우선순위를 정할 수 있으며, 이는 마케팅 담당자가 프로모션을 제시할 고객을 선정할 때 유용한 정보로 활용될 수 있다.

Keywords

References

  1. 고용식, "세일즈 프로모션전략으로서의 VMD에 관한 연구", 한국마케팅과학회 2005 춘계학술대회 발표논문집, (2005), 321-339.
  2. 김진화, 남기찬, 이상종, "Support Vector Machine 기법을 이용한 고객의 구매의도 예측", Information Systems Review, 10권 2호(2008), 137-158.
  3. 안현철, 김경재, 한인구, "Support Vector Machine 을 이용한 고객구매예측모형", 한국지능정보시스템학회논문지, 11권 3호(2005), 69-81.
  4. 홍태호, 박지영, "RCMDE를 적용한 프로모션에 따른 고객등급예측", 한국인터넷전자상거래학회, 한국정보시스템학회 2010년 춘계공동 학술대회논문집, (2010), 155-168.
  5. Baesens, B., S. Viaene, D. Van den Poel, J. Vanthienen and G. Dedene, "Bayesian neural network learning for repeat purchase modelling in direct marketing", European Journal of Operational Research, Vol.138, No.1 (2002), 191-211. https://doi.org/10.1016/S0377-2217(01)00129-1
  6. Cho, S. and H. Shin, "Response modeling with support vector machines", Expert Systems with Applications, Vol.30, No.4(2006), 746-760. https://doi.org/10.1016/j.eswa.2005.07.037
  7. Conul, F. F., B. D. Kim and M. Shi, "Mailing smarter to catalog customer", Journal of Interactive Marketing, Vol.14, No.2(2000), 2-16. https://doi.org/10.1002/(SICI)1520-6653(200021)14:2<2::AID-DIR1>3.0.CO;2-N
  8. Coussement, K. and D. Van den Poel, "Churn prediction in subscription services : An application of support vector machines while comparing two parameter-selection techniques", Expert Systems with Applications, Vol.34, No.1(2008), 313-327. https://doi.org/10.1016/j.eswa.2006.09.038
  9. Huang, Z., S. Chen, C. Hsu, W. Chen and S. Wu, "Credit rating analysis with support vector machines and neural networks : a market comparative study", Decision Support Systems, Vol.37, No.4(2004), 543-558. https://doi.org/10.1016/S0167-9236(03)00086-1
  10. Kim, D., H. Lee and S. Cho, "Response Modeling with Support Vector Regression", Expert Systems with Applications, Vol.34, No.2(2008), 1102-1108. https://doi.org/10.1016/j.eswa.2006.12.019
  11. Kim, Y. S. and W. N. Street, "An intelligent system for customer targeting a data mining approach", Decision Support Systems, Vol.37, No.2(2004), 215-228. https://doi.org/10.1016/S0167-9236(03)00008-3
  12. Lu, C., T. Lee, C. Chiu, "Financial time series forecasting using independent component analysis and support vector regression", Decision Support Systems, Vol.47, No.2(2009), 15-125.
  13. Malthouse, E. C., "Assessing the performance of direct marketing scoring models", Journal of Interactive Marketing, Vol.15, No.1(2001), 49-62. https://doi.org/10.1002/1520-6653(200124)15:1<49::AID-DIR1003>3.0.CO;2-F
  14. Prinzie, A. and D. Van den Poel, "Constrained optimization of data-mining problems to improve model performance : A direct-marketing application", Expert Systems with Applications, Vol.29, No.3(2005), 630-640. https://doi.org/10.1016/j.eswa.2005.04.022
  15. Shin, H. and S. Cho, "Response Modeling with Support Vector Machine", Expert Systems with Applications, Vol. 30, No.4(2006), 746-760. https://doi.org/10.1016/j.eswa.2005.07.037
  16. Suh, E. H., K. C. Noh and C. K. Suh, "Customer list segmentation using the combined response model", Expert Systems with Applications, Vol.17, No.2(1999), 89-97. https://doi.org/10.1016/S0957-4174(99)00026-3
  17. Tay, F. E. H. and L. Cao, "Application of support vector machines in financial time series forecasting", Omega, Vol.29, No.4(2001), 497-505.
  18. Vapnik, S. Golowich and A. Smola, "Support vector method for function approximation regression estimation, and signal processing", In Mozer, M., Jordan, M., Petsche, T. editors, Advances in Neural Information Processing Systems 9, MIT Press, Cambridge, MA, (1999), 281-287.
  19. Vapnik, The Nature of Statistical Learning Theory, Springer, N.Y. 1995.
  20. Wang, K., S. Zhou, Q. Yang and J. M. S. Yeung, "Mining customer value : From association rules to direct marketing", Data Mining and Knowledge Discovery, Vol.11(2005), 57-79. https://doi.org/10.1007/s10618-005-1355-x
  21. Zahavi, J. and N. Levin, "Applying neural computing to target marketing", Journal of Direct Marketing, Vol.11, No.4(1997), 76-93. https://doi.org/10.1002/(SICI)1522-7138(199723)11:4<76::AID-DIR10>3.0.CO;2-D