• Title/Summary/Keyword: Korean Thesaurus

Search Result 224, Processing Time 0.022 seconds

A Study on the Changes in Standards Related to Controlled Vocabulary and Their Implications (통제어휘 표준의 변화 및 시사점에 대한 연구)

  • Kim, Sung-Won;Kim, Jeong-Woo
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.45 no.1
    • /
    • pp.211-232
    • /
    • 2011
  • Thesaurus, a well-known form of controlled vocabulary, has been widely used for indexing and searching of information during the last 50 years. There also have been developments of international and national standards to provide guidelines for developing thesaurus in diverse subject areas. In recent years, the revisions of thesaurus-related standards have been made. Among them are ISO 25964 and BS 8723. This article examines the current status of revision of these standards, and discusses its implications. Based on this examination, it suggests functional requirements of thesaurus in the present information environment, and also proposes elements needed for the development of these functions.

A Study on the Analysis of AGROVOC for Establishment of Concept Relationships of Ontology (온톨로지의 개념간 관계 설정을 위한 AGROVOC 시소러스의 분석에 관한 연구)

  • Yoo, Yeong-Jun
    • Journal of the Korean Society for information Management
    • /
    • v.22 no.1 s.55
    • /
    • pp.125-144
    • /
    • 2005
  • This study uncovered ambiguity and inconsistency of the semantic relationships of the existing thesaurus by analyzing the concept relationships of AGROVOC and proposed the concept relationships of ontology in partially overcoming these limitations. By the results of analyzing the concept relationships, the study proposed conceptual model as most important part of conecept relationships of ontology and semantically developed concept relationship types. These relationships partially can perform inferences and must be useful for information knowledge system based on more exact semantic relationships. Also the study found out new relationship types and they will be useful for extension of the concept relationships of existing thesaurus. And these relationship types showed that they were useful for the existing thesaurus as Legal Thesaurus.

A study of indexing system based on thesaurus for newspaper database (시소러스를 이용한 신문기사 데이타베이스 색인시스템에 관한 연구)

  • 한상길
    • Journal of the Korean Society for information Management
    • /
    • v.11 no.1
    • /
    • pp.125-144
    • /
    • 1994
  • The Matter of vmbulary control for newspaper database has been studied for a long time. These efforts hadn't made any good achievements until JOINS Thesaurus system developed. The purpx of this paper is to introduce JOINS Thesaurus whch the Jcong-ang Daily News has developed for the first time in Korea. In addtion to that, thls study is corn- the efficiency of Auto-Indexing system with postcontrolled indexlng system for newspaper database on thesaurus.

  • PDF

A Study on Multi-Lingual Thesaurus Database Construction of Scientific and Technical Terms (과학기술용어(科學技術用語) 시소러스 대역(對譯) 데이터베이스 구축(構築))

  • Kim, Eun-Shik
    • Journal of Information Management
    • /
    • v.22 no.2
    • /
    • pp.1-28
    • /
    • 1991
  • The objective of this study is to prepare a source data in order to establish a standardization of scientific and technical terms in Korean language. This will contribute to accelerate the production of Korean databases in scientific and technical field and will be used as the most convenient search tool for accessing to the foreign database. This study includes the construction of the multi-lingual thesaurus comprising of Korean, English, and Japanese. First of all a theoretical background on thesaurus is reviewed, and terms are collected from JICST Thesaurus, English-Japanese List and JICST Thesaurus, Japanese-English List published by JICST in 1987. This multi-lingual thesaurus covers 38,318 terms of Korean descriptors, 2,870 terms of Korean non-descriptors, 38,318 terms of English descriptors, 11,910 terms of English non-descriptors, and 38,318 terms of Japanese descriptors, 9,789 terms of Japanese non-descriptors.

  • PDF

Design and Implementation of an Object-Based Thesaurus System: Semi-automated Construction, Abstracted Concept Browsing and Query-Based Reference (객체기반 시소러스 시스템의 설계 및 구현: 반자동화 방식의 구축, 추상화 방식의 개념 브라우징 및 질의기반 참조)

  • Choi, Jae-Hun;Kim, Ki-Heon;Yang, Jae-Dong
    • Journal of KIISE:Databases
    • /
    • v.27 no.1
    • /
    • pp.64-78
    • /
    • 2000
  • In this paper, we design and implement a system for managing domain specific thesauri, where object-oriented paradigm is applied to thesaurus construction, concept browsing and query-based reference. This system provides an objected-oriented mechanism to assist domain experts in constructing thesauri; it determines a considerable part of relationship degrees between terms by inheritance and supplies domain experts with information available from a thesaurus being constructed This information is especially useful to enforce consistency between the hierarchies of a thesaurus, each constructed by different experts in different sites through cooperation. It may minimize the burden of domain eIn this paper, we design and implement a system for managing domain specific thesauri, where object oriented paradigm is applied to thesaurus construction, concept browsing and query based reference. This system provides an objected mechanism to assist domain experts in constructing thesauri: it determines a considerable part of relationship degrees between terms by inheritance and supplies domain experts with information available from a thesaurus being constructed. This information is especially useful to enforce consistency between the hierarchies of a thesaurus, each constructed by different experts in different sites through cooperation. It may minimize the burden of domain experts caused from the exhaustive specification of individual relationship. This system also provides an abstracted browsing and a query based reference, which allow users to easily verify thesaurus terms before they are used in usual boolean queries. The verification is made by actively searching for them in the thesaurus. Reference queries and abstracted browsing views facilitate this searching. The facility is indispensable especially when precision counts for much.

  • PDF

A Study on Converting the Theological Thesaurus to the Ontology by Using SKOS (SKOS를 이용한 신학 시소러스의 온톨로지로의 변환에 관한 연구)

  • Yoo, Yeong-Jun
    • Journal of Korean Library and Information Science Society
    • /
    • v.43 no.3
    • /
    • pp.143-163
    • /
    • 2012
  • In order to convert a thesaurus described by a person to ontology, the first step is to translate the thesaurus to the ontology by using SKOS, which is suitable for conversion to ontology and was chose an international standard by W3C. SKOS is suitable for converting thesaurus or subject headings or classification system to ontology, but we need a web language to describe an ontology as RDF/XML. RDF/XML is so difficult to read and write that we can need RDFa embedded in HTML document or Turtle, which is more easily describable and readable. Along with description using SKOS, this research has experimentally constructed the ontology by using ontology construction program $Prot{\acute{e}}g{\acute{e}}$ 4.2. In addition to basic concept relationships of thesaurus like equivalent relationship, hierarchical relationships, association relationships transitive hierarchical relationships are included suggested by SKOS in this research.

A Study on Methods of Documentary Research on Educational Facilities - Focused on the Utilization of the ERIC - (교육시설(敎育施設)에 관한 문헌연구(文獻硏究) 방법(方法) - 미국 ERIC 자료 활용방법을 중심으로 -)

  • Park, Jae-Youn
    • Journal of the Korean Institute of Educational Facilities
    • /
    • v.1 no.1
    • /
    • pp.33-40
    • /
    • 1994
  • This study was taken to increase efficiency in reviewing documents of school facilities from the network of ERIV(Educational Resources and Information Center, USA). Outline of the ERIC network, and the structure, role, function of the ERIC thesaurus are introduced. A thesaurus have developed for information retrieval purpose provides the filing labels which permit information to be stored by one person and retrieved by another. As an information system grows, its thesaurus is systematically built and refined to the point where it represents, in a very special sense, the vocabulary of subject field. The Thesaurus of ERIC Descriptors represents such a vocabulary for the field education. An understanding of its origins, its function and its limitations, is just as important to the teacher, the student of education or the educational researcher as it is for the indexer or custodian of the information pool it represents. If the Thesaurus is understood and used in an appropriate way, it can give all educators not only insight into the ERIC system but also an increased awareness of the language of their field. A great many terms are necessary to describe the many aspects of education, and the task of relating them in even an approximately consistent way is an enormous one. The undertaking should be managed by people who not only know what they are talking about but who also should be able to predict what people in their field are lilely to be talking about in the near future. It should also enlist people who are willing to pay term to another within the system. To engage a large number of these two kinds of people over a long period of time is very likely to cost a great deal of money. There is very little proprietary value in producing such a list of terms, for it can very easily be copied, adapted, updated, etc. Thus, because of its high cost and low proprietary value, it becomes a task likely to be funded only by a government. A government has many ways of spending its money. However, after the decision has been made to spend money to produce an authority list, one must decide how this authority is to be delegated. The history of the development of the ERIC Thesaurus is the history of how this authority was delegated. Scientific research has thrived on efforts to define terms as precisely as possible. It is difficult to say with certainty, however, that solutions to social problems have thrived on a simple diet of scientific research. Contemporary crises demand new and imaginative ways of conceiving problems and talking about them. If this Thesaurus or any other scheme for normalizing or controlling language inhibits in the slightest measure the creative use of language, it is against it use. Only if the principles and details of the Thesaurus are misunderstood can it be used as a constraint on language in a negative sense. Students of education of every kind should see the The-saurus as an opportunity to become increasingly self-conscious about their language and thus about their assumptions and their approaches to educational problems.

  • PDF

The Development of an Automatic Indexing System based on a Thesaurus (시소러스를 기반으로 하는 자동색인 시스템에 관한 연구)

  • 임형묵;정상철
    • Korean Journal of Cognitive Science
    • /
    • v.4 no.1
    • /
    • pp.213-242
    • /
    • 1993
  • During the past decades,several automatic indexing systems have been developed such as single term indexing.phrase indexing and thesaurus basedidndexing systems.Among these systems,single term indexing has been known as superior to others despte its simpicity of extracting meaningful terms.On the other hand,thesaurus based one has been conceived as producing low retrival rate ,mainly because thesauri do not usually have enough index terms.so that much of text data fail to be indexed if they do not match with any of index terms in thesauri.This paper develops a thesaurus based indexing system THINS that yields higher retrieval rate than other systems.by doing syntactic analysis of text data and matching them with index terms in thesauri partially.First,the system analyzes the input text syntactically by using the machine translation suystem MATES/EK and extracts noun phrases.After deleting stop words from noun phrases and stemming the remaining ones.it tries to index these with similar index terms in the thesaurus as much as possible. We conduct an experiment with CACM data set that measures the retrieval effectiveness with CACM data set that measures the retrieval effectuvenss of THINS with single term based one under HYKIS-a thesaurus based information retrieval system.It turns out that THINS yields about 10 percent higher precision than single term based one.while shows 8to9 percent lower recall.This retrieval rate shows that THINS improves much better than privious ones that only yields 25 or 30 percent lower precision than single term based one.We also argue that the relatively lower recall is cause by that CRCS-the thesaurus included in CACM datea set is very incomplete one,having only more than one thousand terms,thus THINS is expected to produce much higher rate if it is associated with currently available large thesaurus.

A Study on the Korean-Engligh Semantic Thesaurus Construction for Knowledge Management System (지식관리시스템을 위한 의미형 한영 시소러스 구축에 관한 연구)

  • 남영준
    • Journal of Korean Library and Information Science Society
    • /
    • v.32 no.4
    • /
    • pp.77-98
    • /
    • 2001
  • As the role of a library has changed to the integrated management system of knowledge, the library needs new information retrieval tools. The purpose of this study is to propose a method and principle of the Korean-English semantic thesaurus construction for a knowledge management system. The method and principle is as follows; 1) in collecting terminology, I included not only internal documents but external documents on the web as a source for the descriptors extraction. 2) conceptual descriptors are more needed than semantic ones. I also proposed the necessity of the authority files for complement. 3) I proposed the appropriate scale of the descriptors to be 15,000 in a thesaurus. And 4) I proposed a hybrid method that used both a manual and an automatic process in establishing the relationship.

  • PDF

A Theoretical Study on Indexing Methods using the Metadata for the Automatic Construction of a Thesaurus Browser (시소러스 브라우저 자동구현을 위한 Metadata를 이용한 색인어 처리방안에 대한 연구)

  • Seo , Whee
    • Journal of Korean Library and Information Science Society
    • /
    • v.35 no.4
    • /
    • pp.451-467
    • /
    • 2004
  • This paper is intended to present the theoretical analyses on automatic indexing, which is vital in the process of constructing a thesaurus browser, and clustering algorithms to construct hierarchical relations among terms as well as the methods for the automatic construction of a thesaurus browser. The methods to select the index term automatically in the web documents are studied by surveying the methods for analyzing and processing metadata which conforms to bibliographical roles of traditional paper documents in web documents. Also, the result of the study suggests to adding or involving the metadata in web documents, using the metadata automatic editor because metadata is not listed in most of the web documents.

  • PDF