An Implementation of Best Match Algorithm for Korean Text Retrieval in the Client/Server Environment

클라이언트 서버 환경에서 한글텍스트 검색을 위한 베스티매치 알고리즘의 구현

  • Published : 2001.03.01

Abstract

This paper presents the application of best match search algorithm in the client/server system for natural language access to Web-based database. For this purpose, the procedures to process Korean word variants as well as to execute probabilistic weighting scheme have been implemented in the client/server system. The experimental runs have been done using a Korean test set which included documents, queries and relevance judgements. The experimental results demonstrate that best match retrieval with relevance information is better than the retrieval without it.

References

  1. Journal of the American Society for Information Science v.27 Relevance Weightiing of Search Terms Robertson, S. E.;Karen Sparck Jones
  2. Communications of the ACM v.39 Natural Language Processing for Information Retrieval Lewis, D. D;Karen Sparck Jones
  3. Literary and Linguistic Computing v.8 A Comparison of Spelling-Correction Methods for the Identification of Word Forms in Historical Text Databases Robertson, A. M.;P. Willett
  4. 22nd Annual Colloquium on Information Retrieval Research A Probabilistic Approach to Chinese Information Retrieval : Theory nd Experiments Huang, X.;S. Robertson
  5. Journal of Korea Information Management Society v.11 A Development of the Test Set for Estimating the Retrieval Performance of an Automatic Indexer Kim, S. H(et al.)
  6. In submission to Information Processing & Management Effectiveness of the Korean Stemmer for Word Conflation Lee, H. S.;P. Willett
  7. ACM SIGIR Forum v.16 The Nearest Neighbour Problem in Information Retrieval : an Algorithm Using Upperbounds Smeaton, A. F.;C. J. van Rijsbergen
  8. Journal of Documentation v.35 Search Term Relevance Weighting Given Little Relevance Information Sparck Jones, K.
  9. Fourth Annual Symposium on Document Analysis and Information Retrieval Full-text Search and Document Recognition of Japanese Text Fujisawa, H;K. Marukawa
  10. The Journal of Computer Text Processing v.6 Comparison of n-gram Matching and Stemming for Term Conflation in English, Malay and Turkish Texts Ekmekcioglu, F. C.(et al.)
  11. Information Processing & Management v.17 The Selection of Good Search Terms van Rijsbergen, C. J.;D. J. Harper;M. F. Porter
  12. Automatic Text Processing for Korean Language Free Text Retrieval Lee, H. S.
  13. The OKAPI Online Catalogue Research Projects, In: Readings in Information Retrieval Walker, S;K. Sparck Jones(ed.);P. Willett(ed.)
  14. Journal of the American Society for Information Science v.50 Stemming Methodologies over Individual Query Words for an Arabic Information Retrieval System Abu Salem, H.;M. Al Omari;M. W. Evens
  15. Journal of Information Science v.6 A Review of the Use of Inverted Files for Best Match Searching in Information Retrieval System Perry, S. A.;P. Willett
  16. Document Retrieval System Willett, P.(ed.)
  17. Journal of the American Society for Information Science v.47 Cheshire II : Designing a Next-generation Online Catalog Larson, R. R.(et al.)
  18. Journal of Documentation v.33 The probability ranking principle in information retrieval Robertson, S. E.
  19. Information Processing & Management Term Weighting Approaches in Automatic Text Retrieval Salton, G;C. Buckley