Handling Incomplete Data Problem in Collaborative Filtering System

  • Noh, Hyun-Ju (Graduate School of Management, Korea Advanced Institute of Science and Technology) ;
  • Kwak, Min-Jung (Department of Information Statistics, Pyongtaek University) ;
  • Han, In-Goo (Graduate School of Management, Korea Advanced Institute of Science and Technology)
  • Published : 2003.11.01

Abstract

Collaborative filtering is one of the methodologies that are most widely used for recommendation system. It is based on a data matrix of each customer's preferences of products. There could be a lot of missing values in such preference data matrix. This incomplete data is one of the reasons to deteriorate the accuracy of recommendation system. There are several treatments to deal with the incomplete data problem such as case deletion and single imputation. Those approaches are simple and easy to implement but they may provide biased results. Multiple imputation method imputes m values for each missing value. It overcomes flaws of single imputation approaches through considering the uncertainty of missing values. The objective of this paper is to suggest multiple imputation-based collaborative filtering approach for recommendation system to improve the accuracy in prediction performance. The experimental works show that the proposed approach provides better performance than the traditional Collaborative filtering approach, especially in case that there are a lot of missing values in dataset used for recommendation system.

Keywords

References

  1. The $15^{th}$ International Conference on Machine Learning (ICML) Learning collaborative information filters Billsus,D.;Pazzani,M.J.
  2. Proceedings of Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98) Emprirical analysis of predictive algorithms for collaborative filtering Breese,J.S.;Heckerman,D.;Kadie,C.
  3. Journal of the American Statistical Association v.89 Missing data, Imputation, and the Bootstrap Efron,B.
  4. Communications of the ACM v.35 no.12 Using collaborative filtering to weave an information tapestry Goldberg,D.;Nichols,D.;Oki,B.M.;Terry,D.
  5. Proceedings of the 1999 Conference on Research and Development in Information Retrieval, August An algorithmic framework for performing collaborative filtering Herlocker,J.;Konstan,J.;Borchers,A.;Riedl,J.
  6. EachMovie collaborative filtering data set. DEC Systems Reserach Center McJones,P.
  7. Multiple imputation for nonrespones in surveys Rubin,D.B.
  8. Analysis of Incomplete Multivariate Data Schafer,J.
  9. Multiple impuatation: A primer. Statistical Methods in Medical Research v.8 Schafer,J.
  10. Proceedings of ACM CHI"95 Conference on Hunan Factros in Computing Systems Social information filtering: Algorithms for automating "word of mouth" Shardanad,U.;Maes,P.