• Title/Summary/Keyword: thesaurus construction

Search Result 62, Processing Time 0.024 seconds

Design and Implementation of an Object-Based Thesaurus System: Semi-automated Construction, Abstracted Concept Browsing and Query-Based Reference (객체기반 시소러스 시스템의 설계 및 구현: 반자동화 방식의 구축, 추상화 방식의 개념 브라우징 및 질의기반 참조)

  • Choi, Jae-Hun;Kim, Ki-Heon;Yang, Jae-Dong
    • Journal of KIISE:Databases
    • /
    • v.27 no.1
    • /
    • pp.64-78
    • /
    • 2000
  • In this paper, we design and implement a system for managing domain specific thesauri, where object-oriented paradigm is applied to thesaurus construction, concept browsing and query-based reference. This system provides an objected-oriented mechanism to assist domain experts in constructing thesauri; it determines a considerable part of relationship degrees between terms by inheritance and supplies domain experts with information available from a thesaurus being constructed This information is especially useful to enforce consistency between the hierarchies of a thesaurus, each constructed by different experts in different sites through cooperation. It may minimize the burden of domain eIn this paper, we design and implement a system for managing domain specific thesauri, where object oriented paradigm is applied to thesaurus construction, concept browsing and query based reference. This system provides an objected mechanism to assist domain experts in constructing thesauri: it determines a considerable part of relationship degrees between terms by inheritance and supplies domain experts with information available from a thesaurus being constructed. This information is especially useful to enforce consistency between the hierarchies of a thesaurus, each constructed by different experts in different sites through cooperation. It may minimize the burden of domain experts caused from the exhaustive specification of individual relationship. This system also provides an abstracted browsing and a query based reference, which allow users to easily verify thesaurus terms before they are used in usual boolean queries. The verification is made by actively searching for them in the thesaurus. Reference queries and abstracted browsing views facilitate this searching. The facility is indispensable especially when precision counts for much.

  • PDF

A Study on the Evaluation of Newspaper Thesaurus (신문 시소러스의 평가에 관한 연구 : 신문기사 종합시소러스를 중심으로)

  • 이인애
    • Journal of the Korean Society for information Management
    • /
    • v.12 no.1
    • /
    • pp.99-113
    • /
    • 1995
  • This study evaluates representability and comprehensivity of the Theasurus in theeconomics and industry fields of the "General Thesaurus of Newspaper Articles." The methodsused in the study were, first, indexing of the pages covering economics and industry articlesusing the Thesaurus and second, comparing the Thesaurus terms with the words collectedfrom the newspapers articles and glossaries. The study clarifies the following problems whichmight occur in the construction and use of newspaper thesaurus: specificity of the subjectconcepts, separation of component concept, preference relationship between descriptors andentry terms, the methods of recording of proper nouns and allocation of terms among thesubject areas concern.he subject areas concern.

  • PDF

Fuzzy based Thesaurus Construction Supporting Component Retrieval (컴포넌트 검색을 지원하는 퍼지 기반 시소러스 구축)

  • Kim, Gui-Jung;Han, Jung-Soo;Song, Young-Jae
    • The KIPS Transactions:PartD
    • /
    • v.10D no.5
    • /
    • pp.753-762
    • /
    • 2003
  • Many Methodologies have proposed for component retrieval. Among them, thesaurus concept has introduced for similar component retrieval. This paper classified classes by concept according to inheritance relation for efficient retrieval of component, and applied fuzzy logic to thesaurus method and constructed object-oriented thesaurus. Proposed method could express category between concepts automatically, and calculate fuzzy degree between classes by comparing matching and mismatching degree to each class and category and construct thesaurus. Component retrieval is that using classes of component, candidate components are retrieved according to priority order using fuzzy similarity. Also, we improved retrieval performance by thesaurus greatly, setting critical of most suitable through simulation.

A Study on the Extend Guideline for the Equivalence Relationship in Thesaurus (대등관계 설정의 확장 지침에 관한 연구)

  • 남영준
    • Journal of the Korean Society for information Management
    • /
    • v.21 no.2
    • /
    • pp.1-21
    • /
    • 2004
  • For the guarantee of retrieval ratio, thesaurus's maintenance for descriptors are necessary. To maintain the optimum scale of thesaurus, new terms and existing terms should be structured to the equivalence relationship. Therefore, equivalence relationships are needed to new standard. This study proposes new standard of the equivalence relationships, which is more specified for better guarantee of retrieval ratios. The relationships have seven facets. These six facets will be used as new knowledge-base, which could be reestablished between the descriptors.

Newspaper Thesaurus Construction in Theory and Practice (신문 시소러스 개발의 이론과 실제)

  • Chung Young-Mee
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.25
    • /
    • pp.51-82
    • /
    • 1993
  • Effective indexing systems are required to enhance the performance of full-text retrieval systems. The result of the analysis of index terms selected by human indexers without a newspaper thesaurus indicates that controlled indexing language is necessary for effective and consistent indexing of newspaper articles. In this paper, basic principles are established for keyword selection from Korean newspapers and significant problems identified in the process of developing a newspaper thesaurus are discussed in depth.

  • PDF

A Study on Multi-Lingual Thesaurus Database Construction of Scientific and Technical Terms (과학기술용어(科學技術用語) 시소러스 대역(對譯) 데이터베이스 구축(構築))

  • Kim, Eun-Shik
    • Journal of Information Management
    • /
    • v.22 no.2
    • /
    • pp.1-28
    • /
    • 1991
  • The objective of this study is to prepare a source data in order to establish a standardization of scientific and technical terms in Korean language. This will contribute to accelerate the production of Korean databases in scientific and technical field and will be used as the most convenient search tool for accessing to the foreign database. This study includes the construction of the multi-lingual thesaurus comprising of Korean, English, and Japanese. First of all a theoretical background on thesaurus is reviewed, and terms are collected from JICST Thesaurus, English-Japanese List and JICST Thesaurus, Japanese-English List published by JICST in 1987. This multi-lingual thesaurus covers 38,318 terms of Korean descriptors, 2,870 terms of Korean non-descriptors, 38,318 terms of English descriptors, 11,910 terms of English non-descriptors, and 38,318 terms of Japanese descriptors, 9,789 terms of Japanese non-descriptors.

  • PDF

A Study of Designing the Han-Guel Thesaurus Browser for Automatic Information Retrieval (자동정보검색을 위한 한글 시소러스 브라우저 구축에 관한 연구)

  • Seo, Whee
    • Journal of Korean Library and Information Science Society
    • /
    • v.31 no.2
    • /
    • pp.279-302
    • /
    • 2000
  • This study is to develop a new automatic system for the Korean thesaurus browser by which we can automatically control all the processes of searching queries such as, representation, generation, extension and construction of searching strategy and feedback searching. The system in this study is programmed by Delphi 4.0(PASCAL) and consists of database system, automatic indexing, clustering technique, establishing and expressing thesaurus, and automatic information retrieval technique. The results proved by this system are as follows: 1)By using the new automatic thesaurus browser developed by the new algorithm, we can perform information retrieval, automatic indexing, clustering technique, establishing and expressing thesaurus, information retrieval technique, and retrieval feedback. Thus it turns out that even the beginner user can easily access special terms about the field of a specific subject. 2) The thesaurus browser in this paper has such merits as the easiness of establishing, the convenience of using, and the good results of information retrieval in terms of the rate of speed, degree, and regeneration. Thus, it t m out very pragmatic.

  • PDF

A Theoretical Study on Indexing Methods using the Metadata for the Automatic Construction of a Thesaurus Browser (시소러스 브라우저 자동구현을 위한 Metadata를 이용한 색인어 처리방안에 대한 연구)

  • Seo , Whee
    • Journal of Korean Library and Information Science Society
    • /
    • v.35 no.4
    • /
    • pp.451-467
    • /
    • 2004
  • This paper is intended to present the theoretical analyses on automatic indexing, which is vital in the process of constructing a thesaurus browser, and clustering algorithms to construct hierarchical relations among terms as well as the methods for the automatic construction of a thesaurus browser. The methods to select the index term automatically in the web documents are studied by surveying the methods for analyzing and processing metadata which conforms to bibliographical roles of traditional paper documents in web documents. Also, the result of the study suggests to adding or involving the metadata in web documents, using the metadata automatic editor because metadata is not listed in most of the web documents.

  • PDF

A Study on the Korean-Engligh Semantic Thesaurus Construction for Knowledge Management System (지식관리시스템을 위한 의미형 한영 시소러스 구축에 관한 연구)

  • 남영준
    • Journal of Korean Library and Information Science Society
    • /
    • v.32 no.4
    • /
    • pp.77-98
    • /
    • 2001
  • As the role of a library has changed to the integrated management system of knowledge, the library needs new information retrieval tools. The purpose of this study is to propose a method and principle of the Korean-English semantic thesaurus construction for a knowledge management system. The method and principle is as follows; 1) in collecting terminology, I included not only internal documents but external documents on the web as a source for the descriptors extraction. 2) conceptual descriptors are more needed than semantic ones. I also proposed the necessity of the authority files for complement. 3) I proposed the appropriate scale of the descriptors to be 15,000 in a thesaurus. And 4) I proposed a hybrid method that used both a manual and an automatic process in establishing the relationship.

  • PDF

Development of a Thesaurus Management System based on the Object-Oriented Technique (객체지향 기법을 이용한 시소러스 관리 시스템의 개발에 관한 연구)

  • 박계숙
    • Journal of the Korean Society for information Management
    • /
    • v.13 no.2
    • /
    • pp.5-18
    • /
    • 1996
  • For the construction of thesaurus, a thesaurus management system is needed which can process dynamic variations fast and exactly such as input. correction and deletion of words, and definition of new relationship between words. In this paper, I developed a thesaurus management system based on the object-oriented technique and GUI(graphic user interface) screen, and to enhance the effectiveness of information retrieval. I put emphasis on the expansion of synonym, English and Korean words containing the same concept.

  • PDF