• Title/Summary/Keyword: 통제 색인

Search Result 68, Processing Time 0.04 seconds

A Study on the Indexing System Using a Controlled Vocabulary and Natural Language in the Secondary Legal Information Full-Text Databases : an Evaluation and Comparison of Retrieval Effectiveness (2차 법률정보 전문데이터베이스에 있어서 통제어 색인시스템과 자연어 색인시스템의 검색효율 평가에 관한 연구)

  • Roh Jeong-Ran
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.32 no.4
    • /
    • pp.69-86
    • /
    • 1998
  • The purpose of velop the indexing algorithm of secondary legal information by the study of characteristics of legal information, to compare the indexing system using controlled vocabulary to the indexing system using natural language in the secondary legal information full-text databases, and to prove propriety and superiority of the indexing system using controlled vocabulary. The results are as follows; 1)The indexing system using controlled vocabulary in the secondary legal information full-text databases has more effectiveness than the indexing system using natural language, in the recall rate, the precision rate, the distribution of propriety, and the faculty of searching for the unique proper-records which the indexing system using natural language fans to find 2)The indexing system which adds more words to the controlled vocabulary in the secondary legal information full-text databases does not better effectiveness in the retail rate, the precision rate, comparing to the indexing system using controlled vocabulary. 3)The indexing system using word-added controlled vocabulary with an extra weight in the secondary legal information full-text databases does not better effectiveness in the recall rate, the precision rate, comparing to the indexing system using word-added controlled vocabulary without an extra weight. This study indicates that it is necessary to have characteristic information the information experts recognize - that is to say, experimental and inherent knowledge only human being can have built-in into the system rather than to approach the information system by the linguistic, statistic or structuralistic way, and it can be more essential and intelligent information system.

  • PDF

A Study on the Design of a Full-Text Indexing System for Thesis (학위논문의 전문색인시스템 설계)

  • 추윤미
    • Proceedings of the Korean Society for Information Management Conference
    • /
    • 1996.08a
    • /
    • pp.57-60
    • /
    • 1996
  • 전문데이터베이스는 원문의 접근가능성과 전문탐색의 장점으로 인해 최근 급속하게 발전하고 있다. 그러나 이제까지 대부분의 전문데이터베이스는 문헌의 구조를 고려하지 않고 본문의 문자열에서 자동추출한 색인어를 대상으로 비통제탐색방법을 사용하여 왔으므로 효율적이고 다양한 검색방법을 적용하기 어려웠다. 본 연구에서는 SGML을 이용하여 문헌을 구조화하고 이를 이용한 색인시스템을 설계함으로써, 문헌구조를 이용한 다양한 검색이 가능하도록 하였다. 이를 위해 논문을 대상으로 하여 문헌의 구조를 분석하고, 주요 문헌요소인 초록, 목차, 본문, 참고문헌의 특성을 색인에 반영하였다. 색인시스템은 문헌요소를 태그와 텍스트데이터로 분석하여 색인하는 일차색인과, 일차색인에 의해 만들어진 문헌요소테이블과 내용데이터파일을 이용하여 주요 문헌요소를 색인한 이차색인으로 구성된다.

  • PDF

A Study on Christian Website Indexing (기독교 관련 웹 사이트 내 색인에 관한 연구)

  • Yoo, Yeong-Jun
    • Journal of Korean Library and Information Science Society
    • /
    • v.38 no.4
    • /
    • pp.257-276
    • /
    • 2007
  • Back-of-book-style indexes have a similar function as back-of-book indexes. The best advantage o4 back-of-book-style indexes for Information access on the web is to give direct access to specific subjects of interest. Though back-of-book-style indexes are alphabetically arranged as back-of-book indexes, they have linked index entries to contents on the site by using a anchor tag of HTML. In this research, I have created back-of-book-style indexes in two separated ways, by hand-crafted and semi-automatic Indexing. We have utilized back-of-book-style indexes, that is similar to back-of-book index of traditional information organization method of library and information science, in library circumstances.

  • PDF

Hierarchic Document Clustering in OPAC (OPAC에서 자동분류 열람을 위한 계층 클러스터링 연구)

  • 노정순
    • Journal of the Korean Society for information Management
    • /
    • v.21 no.1
    • /
    • pp.93-117
    • /
    • 2004
  • This study is to develop a hierarchic clustering model fur document classification and browsing in OPAC systems. Two automatic indexing techniques (with and without controlled terms), two term weighting methods (based on term frequency and binary weight), five similarity coefficients (Dice, Jaccard, Pearson, Cosine, and Squared Euclidean). and three hierarchic clustering algorithms (Between Average Linkage, Within Average Linkage, and Complete Linkage method) were tested on the document collection of 175 books and theses on library and information science. The best document clusters resulted from the Between Average Linkage or Complete Linkage method with Jaccard or Dice coefficient on the automatic indexing with controlled terms in binary vector. The clusters from Between Average Linkage with Jaccard has more likely decimal classification structure.

A Study on the Elements of Indexing and Abstracting on the World Wide Web (웹 정보자원의 색인과 초록 요소에 관한 연구)

  • Choi Jae-Hwang
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.33 no.1
    • /
    • pp.113-128
    • /
    • 1999
  • Although traditional printed materials m indexed and abstracted by human beings with tools like thesaurus or controlled vocabulary, information resources on World Wide Web(WWW) are automatically indexed and abstracted without human beings efforts. It would be useful to investigate what major differences are in processes and in elements of indexing and abstracting between traditional printed materials and information resources on WWW. After discussing how WWW search engines work, six major WWW search engines were chosen for this study. Then, 17 indexing elements, 11 abstracting elements, and 2 meta elements were examined. Overall trends and issues for the future development of indexing and abstracting on WWW are also discussed.

  • PDF

A study of indexing system based on thesaurus for newspaper database (시소러스를 이용한 신문기사 데이타베이스 색인시스템에 관한 연구)

  • 한상길
    • Journal of the Korean Society for information Management
    • /
    • v.11 no.1
    • /
    • pp.125-144
    • /
    • 1994
  • The Matter of vmbulary control for newspaper database has been studied for a long time. These efforts hadn't made any good achievements until JOINS Thesaurus system developed. The purpx of this paper is to introduce JOINS Thesaurus whch the Jcong-ang Daily News has developed for the first time in Korea. In addtion to that, thls study is corn- the efficiency of Auto-Indexing system with postcontrolled indexlng system for newspaper database on thesaurus.

  • PDF

A Primary Study on Building the Secondary Legal Information Full-Text Databases (2차 법률정보 전문데이터베이스 구축을 위한 기초 연구)

  • Kweon Kie-Won;Roh Jeong-Ran
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.32 no.3
    • /
    • pp.281-296
    • /
    • 1998
  • This study indicates that it is necessary to have characteristic information the information experts recognize-that is to say, experimental and inherent knowledge only human being can have built-in into the system rather than to approach the information system by the linguistic, statistic or structuralistic way, and it can be more essential and intelligent information system. As this study proves that the cited primary legal information within the secondary legal information functions as the index which represents the contents of the text because of the characteristics of legal information, the automatic indexing in the secondary legal full-text databases can be possible without the assitance of the experts. In case of the establishment, amendment or repealing of law, change of index terms can be possible through revising the legal text cited in the secondary legal information full-text databases. Even when we don't input the full-text about retrospective documents, automatic indexing is also possible, and the establishment and the practice of expert knowledge and integrated databases are possible in case of the retrospective documents.

  • PDF

Automatic Korean to English Cross Language Keyword Assignment Using MeSH Thesaurus (MeSH 시소러스를 이용한 한영 교차언어 키워드 자동 부여)

  • Lee Jae-Sung;Kim Mi-Suk;Oh Yong-Soon;Lee Young-Sung
    • The KIPS Transactions:PartB
    • /
    • v.13B no.2 s.105
    • /
    • pp.155-162
    • /
    • 2006
  • The medical thesaurus, MeSH (Medical Subject Heading), has been used as a controlled vocabulary thesaurus for English medical paper indexing for a long time. In this paper, we propose an automatic cross language keyword assignment method, which assigns English MeSH index terms to the abstract of a Korean medical paper. We compare the performance with the indexing performance of human indexers and the authors. The procedure of index term assignment is that first extracting Korean MeSH terms from text, changing these terms into the corresponding English MeSH terms, and calculating the importance of the terms to find the highest rank terms as the keywords. For the process, an effective method to solve spacing variants problem is proposed. Experiment showed that the method solved the spacing variant problem and reduced the thesaurus space by about 42%. And the experiment also showed that the performance of automatic keyword assignment is much less than that of human indexers but is as good as that of authors.

A Study on Organizing the Web Using Facet Analysis (패싯 분석을 이용한 웹 자원의 조직)

  • Yoo, Yeong-Jun
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.15 no.1
    • /
    • pp.23-41
    • /
    • 2004
  • In indexing and organizing Web resources, there have been two basic methods: automatic indexing by extracting key words and library classification schemes or subject directories of search engines. But, both methods have failed to satisfy the user's information needs, due to the lack of standard criteria and the irrationality of its structural system. In this paper I have examined the limits of library classification scheme's structures and the problems related to the nature of Web resources such as specificity and exhaustivity. I have also attempted to explain the logicality of Web resources organization by facet analysis and its strengths and limitations. In so doing, I have proposed three specific methods in using facet analysis: firstly, indexing system by facet analysis; secondly, the alternative transformation of the enumerative classification scheme into facet classification scheme; and finally, the facet model of subject directory of domestic search engine. After examining the three methods, my study concludes that a controlled vocabulary by facet analysis can be employed as a useful method in organizing Web resources.

  • PDF

Constructing Effective Thesaurus Working Model on Client/Server Environment (클라이언트/서버 환경에서 효율적인 시소러스 작업 모형 설계)

  • 장유진;최종필;김민구
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1999.10b
    • /
    • pp.157-159
    • /
    • 1999
  • 정보 검색 시스템은 사용자의 질의어를 용어들과 용어들 사이의 관계 집합으로 구성된 일종의 용어사전인 시소러스를 이용하여 문헌에 대한 색인과 검색을 정확하고 통제된 용어 형태로 바꾸어 색인과 검색 작업의 효율을 높인다. 클라이언트/서버 환경에서 시소러스를 이용하여 정보 검색 속도의 증가를 기대할 수 있다. 분산된 시소러스는 프로파일 정보를 가지고 운영되며 전문적 시소러스로 만들어진다. 본 논문에서 제안한 시소러스 작업 모형을 시뮬레이션 한 결과를 비교, 분석하고 클라이언트/서버 환경에서 효율적인 시소러스의 여할 및 작업 형태에 대해 제안한다.

  • PDF