DOI QR코드

DOI QR Code

Characteristics of Fulltext Index by Human and Automatic Indexing Systems

전문색인에 있어서 수작업 색인과 자동색인의 특성

  • Kim, Gi-Yeong (Dept. of Library and Information Science, Yonsei University)
  • Published : 2008.06.30

Abstract

The purpose of this study is to investigate the characteristics of indexes by human and machine, and differences between them in terms of term identification in a fulltext environment. A back-of-book index and two indexes produced by two term identifiers (LinkIt and Termer) as pseudo-indexing systems for a whole body of a monograph are examined. In the investigation, the traditional contrast between manual and automatic indexing is confirmed in fulltext environment, manual index is for browsing and human use, and automatic index is for searching and machine use. The border between them, however, becomes vague. Some considerations for the use of the term identifiers for browsing and for searching are discussed, and further research for the use of the term identifier is suggested.

본 연구는 전문(fulltext) 환경에서 수작업 색인과 자동색 인의 색 인용어의 특성과 차이점을 알아보는 것을 그 목적으로 한다. 이를 위해 영어로 작성된 단행본에 대한 권말색인과 두 개의 유사 색인 시스템(LinkIt 과 Termer)을 이용한 색인들이 이용되었다. 이러한 비교분석을 통해 수작업 색인은 이용과 브라우징에 대한 강점이 있으며 자동색인은 자동 시스템에서의 탐색에 강점이 있음을 확인하였지만, 양자간의 경계가 불분명해짐도 아울러 확인하였다. 마지막으로 브라우징과 탐색을 위한 유사 색인 시스템의 이용에 있어서 고려할 점과 이에 대한 향후 연구에 대하여 토의하였다.

Keywords

References

  1. The American Society of Indexer. 2007. The American Society of Indexers: Awards. [cited 2008.5.23].
  2. The American Society of Indexer. 2006. The American Society of Indexers: Indexing Evaluation Checklist: The index is the key to the book. [cited 2008.5.23].
  3. Anderson, J.D., & Perez-Carballo, J. 2001a. The nature of indexing: how humans and machines analyze messages and texts for retrieval. Part I: Research, and the nature of manual indexing. Information Processing and Management, 37: 231-254 https://doi.org/10.1016/S0306-4573(00)00026-1
  4. Anderson, J.D., & Perez-Carballo, J. 2001b. The nature of indexing: how humans and machines analyze messages and texts for retrieval. Part II: Machine indexing, and the allocation of human versus machine effort. lriformation Processing and Management, 37: 255-277 https://doi.org/10.1016/S0306-4573(00)00046-7
  5. East, J. W. 2005. Subject retrieval of scholarly monographs via electronic databases. Journal of Documentation, 62: 597-605 https://doi.org/10.1108/00220410610688741
  6. Evans, D.K 1999. A technical Description of the LinkIt System. [cited 2003.1.14].
  7. Gratch, B., Settel, B., & Atherton, P. 1978. Characteristics of book indexes for subject retrieval in the humanities and social sciences. The Indexer, 11: 14-23
  8. Heft, C.A., Jacob, E.K., & Dawson, P. 2000. A usability Assessment of online indexing structures in the networked environment. Journal of the American Society for Information Science, 51: 971-988 https://doi.org/10.1002/1097-4571(2000)9999:9999<::AID-ASI1001>3.0.CO;2-E
  9. Justeson, J.S., & Katz, S.M. 1994. Technical terminology: some linguistic properties and an algorithm for identification in text. Natural Language Engineering, 1: 9-27
  10. Lancaster, F.W. 1998. Indexing and Abstracting in Theory and Practice. Champaign, IL: University of lllinois, Graduate School of Library and Information Science
  11. Lathrop, L.M., Mauer, P. & Wyman, L.P. 1997. Quality and usability in indexes. In Annual Conference Proceedings Society for Technical Communication. [cited 2008.5.23].
  12. Milstead, J.L. 1994. Needs for research in indexing. Journal of the American Society for Information Science, 45: 577-582 https://doi.org/10.1002/(SICI)1097-4571(199409)45:8<577::AID-ASI12>3.0.CO;2-P
  13. Rasmussen, E.M 1994. Indexing and retrieval from full-text. Introduction. In Fiden, R, Hahn, T.B., Rasmussen, E.M., & Smith, P.J. Eds. Challenges in Indexing Electronic Text and Images. Medford, NJ: Learned Infonnation. Inc., 241-245
  14. Rice, R.E., McCreadie, M., & Chang, S. 2001. Accessing and Browsing Information and Communication: An Interdisciplinary Approach. Cambridge, MA: MIT Press
  15. Sparck-Jones, K 1973. Does indexing exhaustivity matter? Journal of the American Society for Information Science, 24: 313-316 https://doi.org/10.1002/asi.4630240502
  16. Wacholder, N. 1998. Simplex NPs clustered by head: A method for identifying significant topics within a document. In Proceedings of the Workshop on the Computational Treatment of Nominals (COLING-ACL '98), August 16, 1998. 70-79
  17. Wacholder, N., Evans, D.K, & Klavans, J.L. 2001. Automatic identification and organization of index terms for interactive browsing. Proceedings of the first ACMIEEE-CS joint conference on Digital libraries (JCDL '01), June 24-28, 2001., Roanoke, Va: 126-134
  18. Wittman, C. 1990. Subheadings in Award-winning book indexes: a quantitative evaluation The Indexer, 17: 3-6
  19. Yang, K 2005. Information retrieval on the web. Annual Review of Information Science and Technology, 39: 33-80 https://doi.org/10.1002/aris.1440390109