• Title/Summary/Keyword: subject indexing

Search Result 68, Processing Time 0.02 seconds

Developing Subject Headings for Children's Picture Books based on A to Zoo (어린이 그림책을 위한 주제명표 개발 연구: 『A to Zoo』를 바탕으로)

  • Park, Ziyoung
    • Journal of the Korean Society for information Management
    • /
    • v.29 no.4
    • /
    • pp.251-271
    • /
    • 2012
  • Subject headings support the effective access of children's picture books. However, it is difficult to select subject terms from titles or table of contents in children's picture books because of their relatively little textual information. Therefore, it is necessary to assign subject terms to each picture book. However, it is not adequate to use general subject headings because the types and levels of general subject headings are different from special subject headings for the children's materials. For this reason, this study aims to develop subject headings for children's picture books. The subject terms in A to Zoo were selected, and the selected terms were translated into Korean and modified for the Korean culture and language. Other reference books, such as Elementary Korean Dictionary, were also used to determine adequate terms for children. The resulting subject headings were assigned to the recommended picture books for children and used to search by subject, browse, and recommend books.

Publication Metrics and Subject Categories of Biomechanics Journals

  • Duane Victor Knudson
    • Journal of Information Science Theory and Practice
    • /
    • v.11 no.4
    • /
    • pp.40-50
    • /
    • 2023
  • Research in interdisciplinary fields like biomechanics is published in a variety of journals whose visibility depends on bibliometric indexing that is often driven by citation analysis of bibliometric databases. This study documented variation in publication metrics and research subject categories assigned to 14 biomechanics journals. Authors, citation, and citation rate (CR) were collected for the top 15 cited articles in the journals retrieved from the Google Scholar service. Research subject categories were also extracted for journals from three databases (Dimensions, Journal Citation Reports, and Scopus). Despite the focus on biomechanics for the journals studied, these biomechanics journals have widely varying CR and subject categories assigned to them. There were significant (p=0.001) and meaningful (77-108%) differences in median CR between average, low, and high CR groups of these biomechanics journals. Since CR are primary data used to calculate most journal metrics and there is no one biomechanics subject category, field normalization for journal citation metrics in biomechanics is difficult. Care must be taken to accurately interpret most citation metrics of biomechanics journals as biased proxies of general usage of research, given a specific database, time frame, and area of biomechanics research.

A Study on Frequency of Subject on Content of Thesis in Field of Science and Technology (과학기술분야 학위논문 내용목차에 따른 주제어 출현빈도에 관한 연구)

  • Lee, Hye-Young;Kwak, Seung-Jin
    • Journal of the Korean Society for information Management
    • /
    • v.25 no.1
    • /
    • pp.191-210
    • /
    • 2008
  • We would generally use subject terms such as subject indexing for searching and accessing documents. So then, there must be any relationship between document's full-text and its subject terms. This study is started in this question. Master's theses in field of science and technology are worked with because full-text is relatively formatted. This study is to study locations of subject term on Thesis, distribution patterns of subject terms on content of full-text; 'Contents', 'Introduction', 'Theory', 'Main subject', 'Conclusion' and 'References'. Thesis were averagely composed of 1226.3 terms. And Subject terms were averagely compose of $12{\sim}13$ terms. As a result, 'Contents' and 'Introduction' have had the most frequency of subject.

A Comparative Study on the Structures of Indexing Languages between LC Subject Headings and Thesaurus (LC주제명표목표와 시소러스의 색인어 구조 비교연구)

  • 김주성;김태수
    • Proceedings of the Korean Society for Information Management Conference
    • /
    • 1995.08a
    • /
    • pp.111-114
    • /
    • 1995
  • 전산환경에서 유용한 색인도구로서의 통제어휘집을 구성하는 원칙과 방법을 제시하고자 전조합색인용 통제어휘집인 LC주제명표목표의 표목구조와 후조합색인용 통제어휘집인 시소러스의 용어구조를 비교하였다. 주제명표목표에서 사용되는 도치표목, 전치사로 연결된 표목, 접속사로 연결된 표목, 세목을 가진 표목을 시소러스에서 사용되는 색인구조와 비교분석 하였다. 주제명표목표가 참조구조를 시소러스체제로 변환시켰을 때 나타나는 문제점도 파악하였다.

  • PDF

Development of Extracting System for Meaning·Subject Related Social Topic using Deep Learning (딥러닝을 통한 의미·주제 연관성 기반의 소셜 토픽 추출 시스템 개발)

  • Cho, Eunsook;Min, Soyeon;Kim, Sehoon;Kim, Bonggil
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.14 no.4
    • /
    • pp.35-45
    • /
    • 2018
  • Users are sharing many of contents such as text, image, video, and so on in SNS. There are various information as like as personal interesting, opinion, and relationship in social media contents. Therefore, many of recommendation systems or search systems are being developed through analysis of social media contents. In order to extract subject-related topics of social context being collected from social media channels in developing those system, it is necessary to develop ontologies for semantic analysis. However, it is difficult to develop formal ontology because social media contents have the characteristics of non-formal data. Therefore, we develop a social topic system based on semantic and subject correlation. First of all, an extracting system of social topic based on semantic relationship analyzes semantic correlation and then extracts topics expressing semantic information of corresponding social context. Because the possibility of developing formal ontology expressing fully semantic information of various areas is limited, we develop a self-extensible architecture of ontology for semantic correlation. And then, a classifier of social contents and feed back classifies equivalent subject's social contents and feedbacks for extracting social topics according semantic correlation. The result of analyzing social contents and feedbacks extracts subject keyword, and index by measuring the degree of association based on social topic's semantic correlation. Deep Learning is applied into the process of indexing for improving accuracy and performance of mapping analysis of subject's extracting and semantic correlation. We expect that proposed system provides customized contents for users as well as optimized searching results because of analyzing semantic and subject correlation.

open-japanese-mesh: assigning MeSH UIDs to Japanese medical terms via open Japanese-English glossaries

  • Yamada, Ryota;Tatieisi, Yuka
    • Genomics & Informatics
    • /
    • v.18 no.2
    • /
    • pp.22.1-22.3
    • /
    • 2020
  • The Medical Subject Headings (MeSH) thesaurus is a controlled vocabulary for indexing biomedical documents that is used for document retrieval and other natural language processing purposes. However, although the oariginal English MeSH is freely available, its Japanese translation has a restricted license. We attempted to create an open alternative, and for this purpose we made a script for assigning MeSH UIDs to Japanese medical terms using Japanese-English glossaries. From the MeSpEn glossary and MEDUTX dictionary, we generated a 12,457-word Japanese-MeSH dictionary.

A Study of Designing the Han-Guel Thesaurus Browser for Automatic Information Retrieval (자동정보검색을 위한 한글 시소러스 브라우저 구축에 관한 연구)

  • Seo, Whee
    • Journal of Korean Library and Information Science Society
    • /
    • v.31 no.2
    • /
    • pp.279-302
    • /
    • 2000
  • This study is to develop a new automatic system for the Korean thesaurus browser by which we can automatically control all the processes of searching queries such as, representation, generation, extension and construction of searching strategy and feedback searching. The system in this study is programmed by Delphi 4.0(PASCAL) and consists of database system, automatic indexing, clustering technique, establishing and expressing thesaurus, and automatic information retrieval technique. The results proved by this system are as follows: 1)By using the new automatic thesaurus browser developed by the new algorithm, we can perform information retrieval, automatic indexing, clustering technique, establishing and expressing thesaurus, information retrieval technique, and retrieval feedback. Thus it turns out that even the beginner user can easily access special terms about the field of a specific subject. 2) The thesaurus browser in this paper has such merits as the easiness of establishing, the convenience of using, and the good results of information retrieval in terms of the rate of speed, degree, and regeneration. Thus, it t m out very pragmatic.

  • PDF

A Study on the Changes in Standards Related to Controlled Vocabulary and Their Implications (통제어휘 표준의 변화 및 시사점에 대한 연구)

  • Kim, Sung-Won;Kim, Jeong-Woo
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.45 no.1
    • /
    • pp.211-232
    • /
    • 2011
  • Thesaurus, a well-known form of controlled vocabulary, has been widely used for indexing and searching of information during the last 50 years. There also have been developments of international and national standards to provide guidelines for developing thesaurus in diverse subject areas. In recent years, the revisions of thesaurus-related standards have been made. Among them are ISO 25964 and BS 8723. This article examines the current status of revision of these standards, and discusses its implications. Based on this examination, it suggests functional requirements of thesaurus in the present information environment, and also proposes elements needed for the development of these functions.

Comparative Analysis of Index Terms and Social Tags: Medical Subject Headings vs. BibSonomy and Delicious

  • Lee, Danielle H.
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.49 no.2
    • /
    • pp.291-311
    • /
    • 2015
  • This paper demonstrates the comparative analysis of the similarity and difference between Medical Subject Headings (MeSH) and social tags. Both types of metadata have the same purpose - that is, succinctly abstracting content of a given document - but are created from heterogeneous viewpoints. The former MeSH terms show the aspects of publication related professionals, whereas the latter social tags are from the perspectives of general readers. When both types of metadata are assigned to the same publications, do they consist of different nomenclatures reflecting the heterogeneous viewpoints or are they similar, since both metadata types describe the same publications? Social tags are also compared with family terms of MeSH terms in the given MeSH hierarchy, so as to understand the specificity of social tags, related to MeSH terms. Lastly, given the fact that readers assign social tags in casual ways without any restricted vocabulary, we tested how many social tags contain consumer health terms, which are familiar to laypeople. Through these comparisons, we ultimately aim to examine how much the highly controlled publication index reflects general readers' cognitive understandings and stress the necessity of general readers' involvement in the publication indexing process.

A Study on Automatic Classification of Subject Headings Using BERT Model (BERT 모형을 이용한 주제명 자동 분류 연구)

  • Yong-Gu Lee
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.57 no.2
    • /
    • pp.435-452
    • /
    • 2023
  • This study experimented with automatic classification of subject headings using BERT-based transfer learning model, and analyzed its performance. This study analyzed the classification performance according to the main class of KDC classification and the category type of subject headings. Six datasets were constructed from Korean national bibliographies based on the frequency of the assignments of subject headings, and titles were used as classification features. As a result, classification performance showed values of 0.6059 and 0.5626 on the micro F1 and macro F1 score, respectively, in the dataset (1,539,076 records) containing 3,506 subject headings. In addition, classification performance by the main class of KDC classification showed good performance in the class General works, Natural science, Technology and Language, and low performance in Religion and Arts. As for the performance by the category type of the subject headings, the categories of plant, legal name and product name showed high performance, whereas national treasure/treasure category showed low performance. In a large dataset, the ratio of subject headings that cannot be assigned increases, resulting in a decrease in final performance, and improvement is needed to increase classification performance for low-frequency subject headings.