DOI QR코드

DOI QR Code

Method of Improving Personal Name Search in Academic Information Service

  • Han, Heejun (NTIS Center, Korea Institute of Science and Technology Information) ;
  • Lee, Seok-Hyoung (Department of Overseas Information, Korea Institute of Science and Technology Information, Department of Library and Information Science, Konkuk University)
  • Received : 2012.07.15
  • Accepted : 2012.11.10
  • Published : 2012.11.01

Abstract

All academic information on the web or elsewhere has its creator, that is, a subject who has created the information. The subject can be an individual, a group, or an institution, and can be a nation depending on the nature of the relevant information. Most information is composed of a title, an author, and contents. An essay which is under the academic information category has metadata including a title, an author, keyword, abstract, data about publication, place of publication, ISSN, and the like. A patent has metadata including the title, an applicant, an inventor, an attorney, IPC, number of application, and claims of the invention. Most web-based academic information services enable users to search the information by processing the meta-information. An important element is to search information by using the author field which corresponds to a personal name. This study suggests a method of efficient indexing and using the adjacent operation result ranking algorithm to which phrase search-based boosting elements are applied, and thus improving the accuracy of the search results of personal names. It also describes a method for providing the results of searching co-authors and related researchers in searching personal names. This method can be effectively applied to providing accurate and additional search results in the academic information services.

Keywords

References

  1. Artiles, J., Gonzalo, J., & Verdejo, F. (2005). A testbed for people searching strategies in the WWW. Research and Development in Information Retrieval - SIGIR, 2005, 569-570.
  2. Christen, P. (2006). A comparison of personal name matching: techniques and practical issues. IEEE International Conference on Data Mining - ICDM, 2006, 290-294.
  3. Culotta, A., Kanani, P., Hall, R., Wick, M., & McCallum, A. (2007). Author disambiguation using error-driven machine learning with a ranking loss function. Workshop on Information Integration on the Web - WIIW, 2006, 32-37.
  4. Guha, R. V., & Garg, A. (2004). Disambiguating people in search. World Wide Web Conference Series - WWW, 2004.
  5. Kalashnikov, D. V., Mehrotra, S., Chen, Z., Nuray-Turan, R., & Ashish, N. (2007). Disambiguation algorithm for people search on the web. International Conference on Data Engineering - ICDE, 2007, 1258-1260.
  6. Kanani, P., McCallum, A., & Pal, C. (2007). Improving author coreference by resource-bounded information gathering from the web. International Joint Conference on Artificial Intelligence, 2007, 429-434.
  7. Pfeifer, U., Poersch, T., & Fuhr, N. (1996). Retrieval effectiveness of proper name search methods. Information Processing & Management, 32(6), 667-679. https://doi.org/10.1016/S0306-4573(96)00042-8
  8. Piskorski, J., Wieloch, K., & Sydow, M. (2009). On knowledge-poor methods for person name matching and lemmatization for highly inflectional languages. Information retrieval, 12(3), 275-299. https://doi.org/10.1007/s10791-008-9085-5
  9. Schutze, H. (1998). Automatic word sense discrimination. Computational Linguistics, 24(1), 97-123.
  10. Vu, Q. M., Masada, T., Takasu, A., & Adachi, J. (2007). Disambiguation of people in web search using a knowledge base. IEEE International Conference on Research, Innovation and Vision for the Future, 2007, 185-191.
  11. Winkler, W. E. (2006). Overview of Record Linkage and Current Research Directions. Washington, DC 20233, U.S : Statistical Research Division, U.S. Census Bureau.
  12. Yang, K. H., Peng, H. T., Jiang, J. Y., Lee, H. M., & Ho, J. M. (2008). Author name disambiguation for citations using topic and web correlation. European Conference on Digital Libraries - ECDL, 2008, 185-196.