DOI QR코드

DOI QR Code

Nearest-Neighbor Collaborative Filtering Using Dimensionality Reduction by Non-negative Matrix Factorization

비부정 행렬 인수분해 차원 감소를 이용한 최근 인접 협력적 여과

  • 고수정 (인덕대학 컴퓨터소프트웨어과)
  • Published : 2006.12.31

Abstract

Collaborative filtering is a technology that aims at teaming predictive models of user preferences. Collaborative filtering systems have succeeded in Ecommerce market but they have shortcomings of high dimensionality and sparsity. In this paper we propose the nearest neighbor collaborative filtering method using non-negative matrix factorization(NNMF). We replace the missing values in the user-item matrix by using the user variance coefficient method as preprocessing for matrix decomposition and apply non-negative factorization to the matrix. The positive decomposition method using the non-negative decomposition represents users as semantic vectors and classifies the users into groups based on semantic relations. We compute the similarity between users by using vector similarity and selects the nearest neighbors based on the similarity. We predict the missing values of items that didn't rate by a new user based on the values that the nearest neighbors rated items.

협력적 여과는 사용자 선호도를 예측하기 위해 그 사용자의 유형을 학습하는 데 목적을 둔 기술이다. 협력적 여과 시스템이 전자상거래에서 성공적인 기술일지라도 그들은 데이터의 고차원성과 희박성이라는 문제점을 갖는다. 본 논문에서는 이와 같은 문제점을 해결하기 위하여 비부정 행렬 인수분해(NNMF, Non-negative Matrix Factorization) 방법을 이용한 최근 인접 협력적 여과 방법을 제안한다. 행렬을 분해하기 위한 전처리로서 사용자 변동 계수를 이용하여 사용자-아이템 행렬의 결측치를 채우고, 이를 대상으로 비부정 분해 방식을 적용하여 행렬을 인수분해 한다. 비부정 분해 방식을 적용한 긍정 분해는 사용자들을 의미를 갖는 벡터로써 표현함으로써 사용자들을 의미 관계를 갖는 그룹으로 표현한다. 이와 같이 벡터로 표현된 사용자들은 벡터 유사도에 의해 그들간의 유사도를 계산한다. 계산된 유사도의 정도에 의해 이웃을 결정하고, 이웃들이 평가한 아이템에 대한 흥미도를 기반으로 새로운 사용자가 평가하지 않은 아이템에 대한 결측치를 예측한다.

Keywords

References

  1. John. S. Breese, D. Heckerman, and C. Kadie, 'Empirical Analysis of Predictive Algoritluns for Collaborative Filtering,' Proceedings of the Conference on Uncertainty in Artificial Intelligence, Madison, WI, 1998
  2. Giuseppe Carenini, Rita Sharma, 'Exploring More Realistic Evaluation Measures for Collaborative Filtering,' Proceeding of AAAl 2004, 2004
  3. Sonny Han Seng Chee, Jiawei Han, and Ke Wang, 'RecTree: An Efficient Collaborative Filtering Method,' Proceedings of the Third International Conference on Data Warehousing and Knowledge Discovery, September, 2001
  4. S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, and R. Harslunan, 'Indexing by latent semantic analysis,' Journal of the American Society for Information Science, Vol. 41, No.6, 1990 https://doi.org/10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9
  5. J. Delgado and N. Ishii, 'Formal Models for Learning of User Preferences, a Preliminary Report,' In Proceedings of International Joint Conference on Artificial Intelligence (IJCAI-99), 1999
  6. L. Grossi, G. Gozzi, and P. Ganugi, 'Distribution Analysis of Items and Ratios in Companies? Accounts using a new iterative procedure,' Proceedings of compstat2002, 2002
  7. Jonathan L. Herlocker, Joseph A. Konstan, Loren G. Terveen, and John T. Riedl, 'Evaluating collaborative filtering recommender systems,' ACM Transactions on Information Systems (TOIS) archive, Vol. 22, No.1, 2004 https://doi.org/10.1145/963770.963772
  8. Lee, D. and Seung, H., 'Algorithms for non-negative matrix factorization,' Advances in Neural Information Processing Systems, pp. 556-562, 2001
  9. Liu, W. and Yi, J., 'Existing and New algorithms for nonnegative matrix factorization', Tech. rep., Department of Computer Sciences, University of Texas at Austin, 2003
  10. MovieLens collaborative filtering data set, Http://www.cs.umn.edu/Research/GroupLens/index.html, GROUPLENS RESEARCH PROJECT, 2000
  11. M. H. Pryor, 'The effects of singular value decomposition on collaborative filtering,' Technical Report PCS-TR98338, Compute Science Department, Dartmouth College, 1998
  12. V. Rijsbergen and C. Joost, Information Retrieval, Butterworths, London-second edition, 1979
  13. G. Salton and M. J. McGill, Introduction to Modern Irformation Retrieval, McGraw-Hill, 1983
  14. B. Sarwar, G. Karypis, J. Konstan, and J. Riedl, 'Application of Dimensionality Reduction in Recommender System A Case Study,' Proceddings of ACM WebKDD, 2000
  15. B. Sarwar, J. Konstan, Al Borchers, J. Herlocker, B. Miller, and J. Riedl, 'Using Filtering' Agents to Improve Prediction Quality In the GroupLens Research Collaborative Filtering System,' Proceedings of the 1998 Conference on Computer Supported Cooperative Work, 1998 https://doi.org/10.1145/289444.289509
  16. B. Sarwar, G. Karypis, J. Konstan, and J. Riedl, 'Application of Dimensionality Reduction in Recommender System A Case Study', Proceddings of ACM WebKDD, 2000
  17. Rita Sharma and David Poole, 'Symmetric Collaborative Filtering Using the Noisy Sensor Model,' Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, 2001
  18. Simon Shepherd, Non-Negative Matrix Factorization, http://www.simonshepherd.supanet.com/nnmf.htm, 2004
  19. Wei Xu , Xin Liu , and Yihong Gong, 'Document clustering based on non-negative matrix factorization,' Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval, July 28-August 01, 2003 https://doi.org/10.1145/860435.860485
  20. 고수정, '협력적 여과 시스템에서 사용자 변동 계수를 이용한 기본 평가값 예측', 정보과학회논문지, 제32권, 11월, 2005
  21. 김윤희, 'Non-negative Matrix Factorization을 이용한 텍스트 문서 및 마이크로어레이 데이터 분석', 서울대학교 학사 학위논문, 2003
  22. 김혜재, 손기락, 'K-최근접 이웃 추천 엔진에서의 벡터 유사도 사용에 대한 실험적 분석', 정보과학회 춘계 학술발표논문집(B), 제28권, 제1호, 2001