Improving Retrieval Effectiveness with Multiple Weighting Schemes

다중 가중치 기법을 이용한 검색 효과의 개선

  • Published : 1995.12.01

Abstract

It has known that different representations of either queries or documents, or different retrieval techniques retrieve different sets of documents. Recent works suggest that significant improvements in retrieval performance can be achieved by combining multiple representations or multiple retrieval techniques. In this paper we propose a simple method for retrieving different documents within a single query representation, a single document representation and a single retrieval technique. We classify the types of documents, and describe the properties of weighting schemes. Then. we explain that different properties of weighting schemes may retrieve different types of documents. Experimental results show that significant improvements can be obtained by combining the retrieval results form different properties of weighting schemes.

질의 또는 문서에 대한 상이한 표현 방법 또는 상이한 검색 기법은 서로 다른 집합의 문서들을 검색함이 알려져 왔다. 최근 이러한 특성을 이용하여 다양한 표현 방법 또는 검색 기법을 결합함으로써 보다 높은 검색 효과를 얻을 수 있음이 입증되었다. 본 논문에서는 질의와 문서에 대한 하나의 표현과 하나의 검색 기법하에서 서로 다른 특성을 갖는 가중치 기법을 결합함으로써 보다 높은 검색 효과를 얻을 수 있음을 기술한다. 문서의 형태를 분류하고 가중치 기법의 특성을 기술한 후, 이를 기반으로 하여 서로 다른 특성을 갖는 가중치 기법은 서로 다른 형태의 문서를 검색함을 설명한다. 또한 실험을 통하여 서로 다른 특성을 갖는 가중치 기법을 결합함으로써 보다 높은 검색 효과를 얻을 수 있음을 입증한다.

Keywords

References

  1. Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval The effect of multiple query representations on information retrieval performance Belkin,N.J.Cool,C.;Croft,W.B.;Callan,J.P.
  2. Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval Passage-level evidence in document retrieval Callan,J.P.
  3. Proceedings of the 2nd Text REtrieval Conference (TREC-2), National Institute of Standards and Technology Special Publication 500-215 Combination of multiple searches Fox,E.A.;Shaw,J.A.
  4. Proceeding of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval Overview of the 1st text retrieval conference Harman,D.
  5. Proceedings of the 2nd Text REtrieval Conference (TREC-2),National Institute of Standards and Technology Special Publication 500-215 Overview of the second text retrieval conference Harman,D.
  6. Research and Development v.1 no.2 A study of the overlap among document representations Technology Katzer,J.;McGill,M.J.;Tessier,J.A.;Frakes,W.;Dasgupta,P.
  7. Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval Combining Multiple Evidence from Different Properties of Weighting Schemes Lee,J.H.
  8. An evaluation of factors affecting document ranking by information retrieval systems McGill,M.;Koll,M.;Norreault,T.
  9. Proceedings of the Second Text REtrieval Conference (TREC-2), National Institute of Standards and Technology Special Publication 500-215 Retrieval of partial documents Moffat,A.;Sacks-Davis,R.;Wilkinson,R.;Zobel,J.
  10. Introduction to Modern Information Retrieval Salton,G.;McGill,M.J.
  11. Information Processing and Management v.24 no.5 Term weighting approaches in automatic text retrieval Salton,G.;Buckley,C.
  12. Automatic Text Processing - the Transformation, Analysis and Retrieval of Information by Computer Salton,G.
  13. Journal of the American Society for Information Science v.39 no.3 A study of information seeking and retrieving. III. Searchers, searches, overlap Saracevic,T.;Kantor,P.
  14. ACM Transactions on Information Systems v.9 no.3 Evaluation of an inference network-based retrieval model Turtle,H.;Croft,W.B.