• 제목/요약/키워드: subject indexing

검색결과 68건 처리시간 0.022초

색인언어의 어의적 관계 및 구문적 관계 (Semantic and syntactic relationships of indexing languages)

  • 윤구호
    • 한국도서관정보학회지
    • /
    • 제22권
    • /
    • pp.1-26
    • /
    • 1995
  • Indexes, especially subject indexes, are major tools for information retrieval. To enhance the retrieval effectiveness of subject indexes, the semantic and syntactic relationships of indexing languages are very important elements. This paper examines the afore-mentioned relationships, based on purely the syntax and semantics of Korean language. The outlines of this study are as follows: 1. The characteristics and usages of controlled vocabularies, particularly subject headings lists and thesaury, are reviewed. 2. The semantic relationships, such as equivalence, hierarchical and associative relationships, are defined, and their categories are investigated in detail. Accordingly, the usages of 'See' and 'See also' references are suggested circumstantially. 3. The syntactic relationships are also examined. Particularly, for the syntactic relationships of multiword indexing terms, two kinds of subject entry formats are compared. Since it is more rational for subject headings organized by the principle of context-dependency, the two-fine entry format is recommended for subject indexes. 4. Computerized production techniques of 'See' and 'See also' reference for the semantic relationships of indexing terms are presented. 5. Computerized production techniques of subject indexes representing the syntactic relationships of indexing terms are also presented.

  • PDF

주제분석기법으로서의 자동색인 (Automatic indexing as a subject analysis technique)

  • 이영자
    • 한국도서관정보학회지
    • /
    • 제12권
    • /
    • pp.61-96
    • /
    • 1985
  • The human subject analysis of a document has some critical problems. The method results in the inconsistency in analysis process and the contradiction of two objects of the subject analysis (one is the identification of the content for the retrieval of specific items and the other is to identify the content for the grouping of related materials). Since the subject analysis by mechanized has been recognized to be the possible way to aggregate the problems of manual analysis, various a n.0, pproaches of automatic indexing have been studied and experimented. This study is to examine the automatic indexing as one of the promising subject analysis techniques by statistical, syntactical and semantic a n.0, pproaches. In conclusion, the reasonable a n.0, pplication time of the automatic indexing should be made a decision based on the through investigation on the cost verse effectiveness, and automatic indexing system should be developed in the close relationship with the on-line search which is a good retrieval system for information explosion society. From now on, since the machine-readable document-text will be envisaged to be more and more available due to the rapid development of computer technology, the more substantial research on the automatic indexing will be also possible, which can bring about the increasing of practical automatic indexing systems.

  • PDF

주제색인의 이론과 실제 (Theory and practice of alphabetical subject indexing)

  • 윤구호
    • 한국도서관정보학회지
    • /
    • 제10권
    • /
    • pp.95-131
    • /
    • 1983
  • Index is a systematic guide to items contained in, or concepts derived from, a collection, Thus, it is represented as a paired set of index terms (t) and documents (D) : I= {(t,D) vertical bar t .mem. V, D .mem. W), where V is index vocabulary and W is document collection. Indexing is the process of analysing the informational content of records of knowledge and expressing the informational content in the language of the indexing system. It involves: 1) Selecting indexable concepts in a document; and 2) expressing these concepts in the language of the indexing system (as index entries): and an ordered list. Indexing process involves technical, semantic and syntactic problems. Technical problems are related to the accuracy of indexing, which is primarily governed by the indexer's ability of analysing subject, identifying indexable concepts, and coding. The proper levels of indexing exhaustivity, and index language specificity are also significant factors affecting the quality of index. Semantic problems are related to the choice of index terms and the form in which they should be used. Equivalent, hierarchical and affinitive/associative relationships of index terms are involved. Syntactic problems are largely related to the coordination of index terms. This process of coordination arises from the need to be able to search for the intersection of two or more classes defined by terms denoting distinct concepts. Finally, most valuable aspects of alphabetical subject indexing theories and practices are derived from those of Cutter, Kaiser, Ranganathan, Coates, Lynch and Austin, and discussed in details.

  • PDF

주제목록을 위한 한국용어열색인 시스템의 기능 (Function of the Korean String Indexing System for the Subject Catalog)

  • 윤구호
    • 한국문헌정보학회지
    • /
    • 제15권
    • /
    • pp.225-266
    • /
    • 1988
  • Various theories and techniques for the subject catalog have been developed since Charles Ammi Cutter first tried to formulate rules for the construction of subject headings in 1876. However, they do not seem to be appropriate to Korean language because the syntax and semantics of Korean language are different from those of English and other European languages. This study therefore attempts to develop a new Korean subject indexing system, namely Korean String Indexing System(KOSIS), in order to increase the use of subject catalogs. For this purpose, advantages and disadvantages between the classed subject catalog nd the alphabetical subject catalog, which are typical subject ca-alogs in libraries, are investigated, and most of remarkable subject indexing systems, in particular the PRECIS developed by the British National Bibliography, are reviewed and analysed. KOSIS is a string indexing based on purely the syntax and semantics of Korean language, even though considerable principles of PRECIS are applied to it. The outlines of KOSIS are as follows: 1) KOSIS is based on the fundamentals of natural language and an ingenious conjunction of human indexing skills and computer capabilities. 2) KOSIS is. 3 string indexing based on the 'principle of context-dependency.' A string of terms organized accoding to his principle shows remarkable affinity with certain patterns of words in ordinary discourse. From that point onward, natural language rather than classificatory terms become the basic model for indexing schemes. 3) KOSIS uses 24 role operators. One or more operators should be allocated to the index string, which is organized manually by the indexer's intellectual work, in order to establish the most explicit syntactic relationship of index terms. 4) Traditionally, a single -line entry format is used in which a subject heading or index entry is presented as a single sequence of words, consisting of the entry terms, plus, in some cases, an extra qualifying term or phrase. But KOSIS employs a two-line entry format which contains three basic positions for the production of index entries. The 'lead' serves as the user's access point, the 'display' contains those terms which are themselves context dependent on the lead, 'qualifier' sets the lead term into its wider context. 5) Each of the KOSIS entries is co-extensive with the initial subject statement prepared by the indexer, since it displays all the subject specificities. Compound terms are always presented in their natural language order. Inverted headings are not produced in KOSIS. Consequently, the precision ratio of information retrieval can be increased. 6) KOSIS uses 5 relational codes for the system of references among semantically related terms. Semantically related terms are handled by a different set of routines, leading to the production of 'See' and 'See also' references. 7) KOSIS was riginally developed for a classified catalog system which requires a subject index, that is an index -which 'trans-lates' subject index, that is, an index which 'translates' subjects expressed in natural language into the appropriate classification numbers. However, KOSIS can also be us d for a dictionary catalog system. Accordingly, KOSIS strings can be manipulated to produce either appropriate subject indexes for a classified catalog system, or acceptable subject headings for a dictionary catalog system. 8) KOSIS is able to maintain a constistency of index entries and cross references by means of a routine identification of the established index strings and reference system. For this purpose, an individual Subject Indicator Number and Reference Indicator Number is allocated to each new index strings and new index terms, respectively. can produce all the index entries, cross references, and authority cards by means of either manual or mechanical methods. Thus, detailed algorithms for the machine-production of various outputs are provided for the institutions which can use computer facilities.

  • PDF

중국의 혁명역사기록물의 목록기술과 검색분류(2) (Bibliographical Description and Classification Indexing For Revolutionary Historical Archives in China(2))

  • 이승휘
    • 기록학연구
    • /
    • 제5호
    • /
    • pp.209-242
    • /
    • 2002
  • Bibliographical Description for Revolutionary Historical Archives is created to describe records at the item level. It defines descriptive elements, punctuations, formats and methods. Descriptive elements are composed of 20 elements, each of which is either mandatory or optional. Mandatory elements are: repositories codes, documents codes, dates, creators, title, classification codes, and subject vocabularies. Abstracts were previously included in card cataloging and are removed in the computerized system. New elements, such as "uncontrolled vocabularies," "name of places," "personal names," "organizational structures" and "meetings," are added to allow keyword search. Considering that subject vocabulary searches are the most important in computerized systems, however, Guidelines for the Subject Indexing for Revolutionary Historical Archives as well as Subject Headings, as a result from the Guidelines, are created. The most extraordinary features in Chinese archival description are said to be the Guidelines for the Classification Indexing for Revolutionary Historical Archives and Materials as well as the Classification Scheme, both of which are created to allow subject search of records content. It is because Chinese practice of records management distinguishes the classification for arrangement from that for retrieval. Chinese archival description is, therefore, composed of bibliographic description rules, subject headings, and the classification scheme for retrieval.

구문 . 통계적 기법을 이용한 한국어 자동색인에 관한 연구 (An experiment in automatic indexing with korean texts : a comparison of syntactico-statistical and manual methods)

  • 서은경
    • 정보관리학회지
    • /
    • 제10권1호
    • /
    • pp.97-124
    • /
    • 1993
  • 본 논문은 자연어 형태의 한국어 텍스트 부터 주제를 대표할 수 있는 색인어를 자동으로 추출하는 실험적인 구문 . 통계적 자동색인 시스템을 구현하였다. 구문 . 통계적 자동색인 시스템은 형태소 분석과 단어 가증 기법을 이용하여 단일어와 명사구를 동시에 선택하는 자동색인 시스템을 말한다. 시스템의 성능을 측정하기 위하여, 300개의 우리말 학술 및 학위논문 초록에서 선택된 단일 . 복합어 색인어를 수작업 색인과 비교하였다. 이와 같은 실험 결과를 가지고 아직 미흡한 연구상태인 우리말 자동색인 개발에 있어서 필요한 기초자료를 제시하였다.

  • PDF

주제 색인법의 분석적 고찰(1)

  • 윤구호
    • 정보관리학회지
    • /
    • 제5권1호
    • /
    • pp.3-30
    • /
    • 1988
  • 주제 색인은 정보 검색의 필수 도구이다. 커터의 사전체 목록 규칙의 출간이래 다양 한 주제 색인법이 연구 개발 되었다. 그러나 그 대부분이 영어를 비롯한 서구어를 중심으로 한 이론과 기법으로서 우리나라 도서관에서 그대로 적용하기에는 맞지 않는 경우가 많다. 한국어의 통사론과 어의론에 가장 알맞는 새로운 주제 색인법을 개발하기 위한 선행 연구로 서 국내외의 주요 색인법을 검토 분석하였다.

  • PDF

On The Full-Text Database Retrieval and Indexing Language

  • Chang, Hye-Rhan
    • 정보관리학회지
    • /
    • 제4권1호
    • /
    • pp.24-46
    • /
    • 1987
  • 최근 원문 데이타베이스의 증가는 주제접근의 새로운 가능성을 제시하였다. 온라인 정보검색은 근본적으로 색인언어와 컴퓨터 기술의 문제이다. 본 연구의 목적은 전통적인 서지 데이타베이스 검색과 비교하여 원문 데이터 베이스 검색의 특징과 성능을 규명하는데 있다. 색인언어에 따른 검색효율, 현재 응용되고 있는 원문 데이타베이스 탐색 시스템, 통제어휘의 새로운 역할 등을 살펴보았다. 이 논문은 또한 원문 데이타베이스의 검색성능 실험에 대한 리뷰를 포함한다.

  • PDF

A One-Size-Fits-All Indexing Method Does Not Exist: Automatic Selection Based on Meta-Learning

  • Jimeno-Yepes, Antonio;Mork, James G.;Demner-Fushman, Dina;Aronson, Alan R.
    • Journal of Computing Science and Engineering
    • /
    • 제6권2호
    • /
    • pp.151-160
    • /
    • 2012
  • We present a methodology that automatically selects indexing algorithms for each heading in Medical Subject Headings (MeSH), National Library of Medicine's vocabulary for indexing MEDLINE. While manually comparing indexing methods is manageable with a limited number of MeSH headings, a large number of them make automation of this selection desirable. Results show that this process can be automated, based on previously indexed MEDLINE citations. We find that AdaBoostM1 is better suited to index a group of MeSH hedings named Check Tags, and helps improve the micro F-measure from 0.5385 to 0.7157, and the macro F-measure from 0.4123 to 0.5387 (both p < 0.01).

색인사 연구

  • 박준식
    • 한국도서관정보학회지
    • /
    • 제2권
    • /
    • pp.23-59
    • /
    • 1975
  • Indexes has not devcloped as an independent branch in library science from the beginning, but it has gradually evolved in a clo~eas sociation with catalog and under the direct influence of the development of publishing pro cesses and of the rapid social changes. Historically, index in the West can be traced back to eariler concordance. On the other hand, index in the Bast does not show a continuous development. It started with book catnlog, but other types of indexing were later 'adopted from the West. Indexing in the West and in the East can be summarized as follows: 1) In the West, Taylor considers Gesner's Pandectae was the first index but the Concordance of the Bible in 1247 was the first true index. Indexing method was first established later in 1545 in Gesner's Partitiones which appeared in three volumes. Classified index appeared after Partitions, but alphabetically ordered index was not developed until th eseventeenth century. The pxiodical index of La France S~auante in 1683 proved -its value, and Poole's An Alphabetical Index in the nineteenth century became the turning point in the development of indexing. After Poole's Index appeared periodical index and book catalog gradually began to be treated separately, and subject index and cross reference were incorporated into indexing. Also dictionary arrangement of the indexed items was adopted in the second half of the nincteenth, century after Charles A. Cutter developed his theory of rules for dictionary catalog and systematic studies of indexing were carried out by many scholars. In the twentieth century, index was mainly developed in the United States of America, especially by Wilson publishing Company. The general trend is to move away from the gcncral index to subject index. Also the ncwspapcr indcx such as The Times I~zdcx is 21 landmark in the history cf indcxing. 2) In China, thcs arc somc cvidcnccs that $Bizgluh(&), $ was the first indcx, but unforlunatcly the book itsclf has not been found as yet.

  • PDF