Comparative Evaluation of User Similarity Weight for Improving Prediction Accuracy in Personalized Recommender System

Jung Kyung-Yong;Lee Jung-Hyun;

Journal of the Institute of Electronics Engineers of Korea CI (전자공학회논문지CI)

Volume 42 Issue 6
/
Pages.63-74
/
2005
/
1229-6376(pISSN)

The Institute of Electronics and Information Engineers (대한전자공학회)

Comparative Evaluation of User Similarity Weight for Improving Prediction Accuracy in Personalized Recommender System

개인화 추천 시스템의 예측 정확도 향상을 위한 사용자 유사도 가중치에 대한 비교 평가

Jung Kyung-Yong (Division of Information Technology, Hansei Univ.) ;
Lee Jung-Hyun (School of Computer Sci. & Eng., Inha Univ.)

정경용 (한세대학교 IT학부) ;
이정현 (인하대학교 컴퓨터공학부)

Published : 2005.11.01

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In Electronic Commerce, the latest most of the personalized recommender systems have applied to the collaborative filtering technique. This method calculates the weight of similarity among users who have a similar preference degree in order to predict and recommend the item which hits to propensity of users. In this case, we commonly use Pearson Correlation Coefficient. However, this method is feasible to calculate a correlation if only there are the items that two users evaluated a preference degree in common. Accordingly, the accuracy of prediction falls. The weight of similarity can affect not only the case which predicts the item which hits to propensity of users, but also the performance of the personalized recommender system. In this study, we verify the improvement of the prediction accuracy through an experiment after observing the rule of the weight of similarity applying Vector similarity, Entropy, Inverse user frequency, and Default voting of Information Retrieval field. The result shows that the method combining the weight of similarity using the Entropy with Default voting got the most efficient performance.

전자상거래에서 최근 대부분의 개인화된 추천 시스템들은 협력적 필터링 기술을 적용하고 있다. 이 방법은 사용자의 성향에 맞는 아이템을 예측하고 추천하기 위하여 비슷한 선호도를 가지는 사용자들간의 유사도 가중치를 계산한다. 이때 일반적으로 피어슨 상관계수를 많이 사용한다. 그러나 이 방법은 두 사용자가 공통으로 선호도를 평가한 아이템들이 있을 때만 상관관계를 계산할 수 있으므로 예측의 정확도는 떨어진다. 사용자 유사도 가중치는 사용자의 성향에 맞는 아이템을 예측하는 경우 뿐만 아니라 개인화된 추천 시스템의 성능에 영향을 미칠 수 있다. 본 논문에서는 정보검색 분야의 벡터 유사도, 엔트로피, 역 사용자 빈도, 기본 선호도 평가를 적용하여 유사도 가중치 공식에 대해서 살펴보고, 추천 시스템의 예측 정확도 향상에 대해서도 실험을 통해 확인해 보았다. 실험 결과는 엔트로피를 이용한 유사도 가중치에 기본 선호도 평가를 결합하는 방법이 가장 성능이 우수함을 알 수 있다.

Keywords

References

M. Pazzani, 'A Framework for Collaborative, Content-based and Demographic Filtering,' Artificial Intelligence Review, pp.393-408, 1999 https://doi.org/10.1023/A:1006544522159
Sarwar, B. M., Karypis, G., Konstan, J. A. and Riedl, J., 'Item-based Collaborative Filtering Recommendation Algorithms,' Proc. of the Tenth Int. WWW Conf. 2001, pp. 285-295, 2001 https://doi.org/10.1145/371920.372071
D. Billsus, M. J. Pazzani, 'Learning Collaborative Information Filters,' in Proc. of ICML, pp. 46-53, 1998
J. Konstan, B. Millr, D. Maltz, J. Herlocker, L. Gordon, and J. Riedl, 'GroupLens: Applying Collaborative Filtering to Usenet News,' Communications of the ACM, Vol.40, No.3, pp.77-87, 1997 https://doi.org/10.1145/245108.245126
P. Resnick, N. Iacovou, M. Sushak, P. Bergstrom, and J. Riedl. GroupLens: An open architecture for collaborative filtering of netnews. In Proceedings of the Computer Supported Collaborative Work Conference, pages 175-186, 1994 https://doi.org/10.1145/192844.192905
정경용, 김진현, 나영주, '소재 설계를 위한 감성공학적 Textile 기반의 협력적 필터링 시스템', 한국섬유공학괴.한국의류학회.한국염색가공학회 공동 추계학술대회 논문집, 2002
K. Y. Jung, Y. J. Na, J. H. Lee, 'FDRAS: Fashion Design Recommender Agent System using the Extraction of Representative Sensibility and the Two-Way Filtering on Textile,' LNCS 2736, Springer-Verlag, pp. 631-640, 2003
정경용, 김진현, 나영주, 소재 설계를 위한 감성 공학적 디자인 지원시스템 개발, 최종 연구 개발 보고서, 한국학술진흥재단, 2003
B. Sarwar, et. al., 'Analysis of Recommendation Agorithms for E-Commerce,' in proc. of the ACM E-Commerce 2000 Conference, 2000
정경용, 최성용, 임기욱, 이정현, '베이지안 추정치가 부여된 유사도 가중치와 연관 사용자 군집을 이용한 선호도 예측 시스템', 정보과학회논문지 : 소프트웨어 및 응용, 제30권, 제4호, pp. 316-325, 2003.4
정경용, 류중경, 강운구, 이정현, '내용 기반 여과와 협력적 여과의 병합을 통한 추천 시스템에서 조화 평균 가중치.' 정보과학회논문지: 소프트웨어 및 응용, 제30권, 제3호, pp. 239-250, 2003
정경용, 협력적 여과 시스템에서 연관 사용자 군집과 베이지안 추정치를 이용한 예측 방법, 인하대학교 대학원, 석사학위논문, 2002
정경용, 김진수, 김태용, 이정현, '선호도 재계산을 위한 연관 사용자 군집 분석과 Representative Attribute-Neighborhood을 이용한 협력적 필터링 시스템의 성능향상,' 한국정보처리학회(B), 제l0-B 권, 제3호, pp. 287-296, 2003 https://doi.org/10.3745/KIPSTB.2003.10B.3.287
Jin Ok Kim, Sung Jin Seo and Chin Hyun Chung, 'Real-Time Face Recognition by the PCA with Color Images,' Lecture Notes in Computer Science, LNCS 3036, pp. 687-690, June 2004 https://doi.org/10.1007/b97987
K. Y. Jung, J. H. Lee, 'Prediction of User Preference in Recommendation System using Association User Clustering and Bayesian Estimated Value,' LNAI 2557, Springer-Verlag, pp. 284-296, 2002
T. Michael, Maching Learning, McGraw-Hill, pp. 154-200, 1997
정영미, 정보검색론, 구미무역 출판부, 1993
강창언, 오용선, 이명호, 정보이론 - 토딩 이론과의 접목, 생능사, pp. 233-285, 1987
R. E. Blahut, Principles and Practice of Information Theory, pp. 12-18, 1991
P. McJones, EachMovie, www.research.digital.com/SRC/eachmovie. 1997

Journal of the Institute of Electronics Engineers of Korea CI (전자공학회논문지CI)

Comparative Evaluation of User Similarity Weight for Improving Prediction Accuracy in Personalized Recommender System

개인화 추천 시스템의 예측 정확도 향상을 위한 사용자 유사도 가중치에 대한 비교 평가

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)