DOI QR코드

DOI QR Code

Machine-Learning Based on Relevance Feedback: A Powerful Engine to Enhance the Performance of SDI System

기계학습 기반 피드백 과정을 통한 SDI 시스템의 성능향상에 관한 연구

  • Published : 2004.12.01

Abstract

As the Internet facilitates the rapid increase of information availability, the study on SDI service that provides users with relevant document in a timely manner has been developed. However, the practical use of this service has been low. This thesis aims at analyzing the reasons for this and developing relevance feedback based SDI system to improve the performance of the existing SDI system. Experimental systems that are developed for this study are SDI system based on users' minimum intervention feedback, SDI system based on perfect automation feedback, and SDI system based on users' maximum intervention feedback. The fourth system that utilizes the traditional SDI system is also studied to evaluate the level of performance improvement of the newly developed three types of SDI system. As a result of this study, SDI system based on users' maximum intervention feedback showed greatest performance improvement. The next performance improvement happened in order of SDI system based on perfect automation feedback, SDI system based on users' minimum intervention feedback, and the traditional SDI system. Feedback based systems showed greater performance improvement as they went through more feedback processes.

정보시대의 도래로 정보량은 기하급수적으로 증가하게 되었고, 이러한 대량의 정보로부터 이용자 개개인에게 적합한 정보를 적시에 제공할 수 있는 방법으로 SDI 서비스가 연구${\cdot}$개발되어 왔지만, 현실적으로 그 활용도는 매우 낮은 것으로 조사되었다. 이에 본 논문에서는 그 원인을 분석하고 SDI 시스템의 성능을 개선시킬 수 있는 적합성 피드백 기반 SDI 시스템을 개발하고자 하였다. 본 연구의 실험을 위해 개발된 실험시스템은 이용자 최소개입 피드백기반 SDI 시스템, 완전자동 피드백기반 SDI 시스템, 그리고 이용자 최대개입 피드백 기반 SDI 시스템이며, 새로 개발된 3개 시스템의 성능 개선정도를 평가하기 위해 네 번째 시스템으로서 전통적인 SDI 서비스에서 사용하고 있는 방법으로 시스템을 개발하였다. 실험결과 이용자 최대개입 피드백 기반 SDI 시스템이 가장 높은 성능을 보여 주었고, 완전자동 피드백 기반, 이용자 최소개입 피드백기반, 전통적 SDI 시스템 순으로 나타났으며, 피드백기반 시스템들은 피드백이 진행될수록 그 성능이 향상되는 것으로 나타났다.

Keywords

References

  1. 노영희. 2003. 국내 대학도서관의 SDI 서비스 제공현황 분석 및 통합형 서비스 시스템 구축 방안에 관한 연구. '정보관리학회지', 20(3): 199-223
  2. 정영미. 1993. '정보검색론'. 개정판. 서울: 구미무역
  3. Amati, Gianni, Fabio Crestani, and Flavio Ubaldini. 1997. 'Learning System for Selective Dissemination of Information.' Proceedings of IJCCAI-97. 15th International Joint Conference on Artificial Intelligence, (1): 764-769
  4. Belkin, N. J. and W. B. Croft. 1992. 'Information Filtering and Information Retrieval: Two Sides of the Same Coin.' Communications of the ACM. 35(12): 29-38 https://doi.org/10.1145/138859.138861
  5. Blake, P. 1997. 'Exploring the News.' Information World Review, 191: 17-18
  6. Bonifati. Angela, Stefano Ceri. and Stefano Paraboschi. 2001. 'Pushing Reactive Services to XML Repositories using Active Rules.' Computer Networks. 39(5): 633-641
  7. Boughanem. M, and M. Tmar. 2002. 'Incremental Adaptive Filtering: Profile Learning and Threshold Calibration.' ACM, 640-644
  8. Callan, J. P., W. B. Croft, and J. Broglio. 1995. 'TREC and TIPSTER Experiments with INQUERY.' Information Processing and Management, 31(3): 327-343 https://doi.org/10.1016/0306-4573(94)00050-D
  9. Callan. Jamie. 1996. 'Document Filtering with Inference Networks.' SIGIR' 96, 262-269
  10. Chung, Young Mee, Young Hee Noh, 2003. 'A Study on Automatic text categorization of internet documents.' Journal of Information Science. 29(1): 117-126 https://doi.org/10.1177/016555150302900204
  11. Dasarathy. Belur V. 1991. Nearest Neighbor(NN) Norms: NN Patern Classification Techniuqes. McGraw-Hill Computer Science Series. Las Alamitos, California: IEEE Computer Society Press
  12. Fischer, G. and C. Stevens. 1991. 'Information Access in Complies, Poorly Structured Information Spaces.' In Proceedings of ACM Special Interest Group on Human Computer Interaetion Annual Conference (New Orleans. La., Apr. 27-May 2). ACM. New Youk: 63-70
  13. Foltz. P. W. and S. T. Dumais. 1992. 'Personalized Information Delivery: An Analysis of Information Filtering Methods.' Communications of the ACM. 35(12): 51-60 https://doi.org/10.1145/138859.138866
  14. Frants. V. I.. N. I. Kamenoff. and J. Shapiro. 1993. 'One Approach to Classification of Users and Automatic Clustering of Documents.' Information Processing and Management. 29(2): 187-195 https://doi.org/10.1016/0306-4573(93)90002-U
  15. Goker, A. and T. L. McCluskey. 1991. 'Toward an Adaptive Information Retrieval System.' In Proceedings of 6th International Symposium (Charlotte. N. C., Oct. 16-19). ISMIS: 348-357
  16. Iwayama. Makato. and Takenobu Tokunaga. 1995. 'Cluster-based Text Categorization: A Comparison of Category Search Strategies.' Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR): 273-281
  17. Lang, K. 1995. 'NewsWeeder: An adaptive multi-user text filter.' Tech. Rep., School of Computer Science. Carnegie Mellon Univ., Pittsburgh, Pa
  18. Manlone, T. W.. K. R. Grant. F. A.. Trubak. F. A., Brobst, S. A., and M. D. Cohen. 1987. 'Intelligent Information Sharing Systems.' Commun. ACM 30. 5(May): 390-402 https://doi.org/10.1145/22899.22903
  19. Masand. B., G. Linoff. and D. Waltz. 1992. 'Classifying News Stories Using Memory based Reseonin.' Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development 1n Information Retrieval (SIGIR): 59-64
  20. Myaeng. S. H. and R. R. Korfhage R. 1990. 'Integration of User Profiles: Models and Experiments in Information Retrieval.' Information Processing Management.,. 26(6): 719-738 https://doi.org/10.1016/0306-4573(90)90048-7
  21. Oard, Douglas W. and Nicholas DeClaris. 1995. 'Experimental Investigation of High Performance Cognitive and Interactive Text Filtering.' In Proceedings of IEEE International Conference on Systems. Man. and Cybernetics. Vancouver. Canada: 4398-4403
  22. Packer, K. H. and D. Soergel. 1979. 'The Importance of SDI for Current Awareness in Fields with Severe Scatter of Information.' Journal of the American Society for Information Science. 30(3): 125-135 https://doi.org/10.1002/asi.4630300303
  23. Resnick. Paul, Neophytos Iacovou. Mitesh Suchak. Peter Bergstrom. and John Riedl. 1994. 'Group-Lens: An Open Architecture for Collaborative Filtering of Netnews.' In R. K. Faruta and C. M. Neuwirth. editors. Proceedings of the Conference on Computer Supported Cooperative Work, 175-186
  24. Rich. E. 1983. 'Users are Individuals: Individualizing User Models.' International Journal. Man-Mach. Studies. 18: 199-214
  25. Robertson, S. E., S. 'Walker. S. Jones, M. M. Hancock-Beaulieu, and M. Gatford. Okapi at TREC-3.' In D. K. Harman, editor, The Third Text Retrieval Conference (TREC-3), Gaithersburg. MD. 1995. National Institute of Standards and Technology, Special Publication: 500-225
  26. Salton. Gerald and J. McGill Michael. 1983. Introduction to Modern Information Retrieval. McGraw-Hill
  27. Seth, B. D. 1994. A Learning Approach to Personalized Information Filtering. M. S. Thesis. Electrical Engineering and Computer Science Dept., MIT, Cambridge, Mass
  28. Singhal, Amit. Mandar Mitra. and Chris Buckley. 1997. 'Learning Routing Queries in a Query Zone.' SIGIR 97: 25-32
  29. Walker. S., S. E. Robertson, M. Boughanem, G. J. F. Jones. and K. Spark Jones. 1997. Kkapi/Keebow ant TREC-6 automatic and ad hoc, VLC, Routing, Filtering and QSDR, TREC-6
  30. Wyle. M. F. and H. P. Frei. 1989. 'Retrieving Highly Dynamic Distributed Information.' In Proceedings of the ACM SIGIR international Conference on Research and Development in Information Retrieval: 108-115
  31. Yan T. and H. Garcia-Molina. 1995. 'SIFT - A tool for wide-area information dissemination.' In Proceedings of the 1995 USENIX Technical Conference: 177-186
  32. Yan T. and H. Garcia-Molina. 1994. 'Distributed Selective Dissemination of Information.' In Proceedings of the 3rd International Conference on Parallel and Distributed Information Systems (PDIS, Austin. TX Sept.): 89 - 98
  33. Yang. Y. 1994. 'Expert Network: Effective and Efficient Learning from Human Decisions in Text Categorization and Retrieval.' Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR): 11-21
  34. Yochum, J. A. 1985. 'A High-Speed Text Scanning Algorithm utilizing Least Frequent Trigraph.' In Proceedings of the IEEE International Symposium on New Directions in Computing: 114-121