• Title/Summary/Keyword: Digital libraries

Search Result 512, Processing Time 0.024 seconds

A Feature Re-weighting Approach for the Non-Metric Feature Space (가변적인 길이의 특성 정보를 지원하는 특성 가중치 조정 기법)

  • Lee Robert-Samuel;Kim Sang-Hee;Park Ho-Hyun;Lee Seok-Lyong;Chung Chin-Wan
    • Journal of KIISE:Databases
    • /
    • v.33 no.4
    • /
    • pp.372-383
    • /
    • 2006
  • Among the approaches to image database management, content-based image retrieval (CBIR) is viewed as having the best support for effective searching and browsing of large digital image libraries. Typical CBIR systems allow a user to provide a query image, from which low-level features are extracted and used to find 'similar' images in a database. However, there exists the semantic gap between human visual perception and low-level representations. An effective methodology for overcoming this semantic gap involves relevance feedback to perform feature re-weighting. Current approaches to feature re-weighting require the number of components for a feature representation to be the same for every image in consideration. Following this assumption, they map each component to an axis in the n-dimensional space, which we call the metric space; likewise the feature representation is stored in a fixed-length vector. However, with the emergence of features that do not have a fixed number of components in their representation, existing feature re-weighting approaches are invalidated. In this paper we propose a feature re-weighting technique that supports features regardless of whether or not they can be mapped into a metric space. Our approach analyses the feature distances calculated between the query image and the images in the database. Two-sided confidence intervals are used with the distances to obtain the information for feature re-weighting. There is no restriction on how the distances are calculated for each feature. This provides freedom for how feature representations are structured, i.e. there is no requirement for features to be represented in fixed-length vectors or metric space. Our experimental results show the effectiveness of our approach and in a comparison with other work, we can see how it outperforms previous work.

A Study on the Library Services for the Solution of the Information Inequality of the Low-Income People in Korea (저소득계층의 정보불평등 해소를 위한 도서관서비스 관련 연구)

  • Ahn, In-Ja;Noh, Younghee;Chang, Rosa
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.29 no.4
    • /
    • pp.113-137
    • /
    • 2018
  • This study examined relevant literature to review the concept of low-income class, its criteria and types. Based on the cases surveyed, the service status of public libraries for low-income class in South Korea was determined. Based on the findings of this study, we proposed new library services necessary for low-income class, such as the implementation of relevant programs aimed at resolving the digital information divide of low-income class, the introduction of employment programs for the self-economic-support for low-income class adults, and the introduction of reading and counseling therapy services. In addition, this study proposed five policies for activating library services for the low-income class: (1) To revise the act on library law so as to clarify the concept of low-income class, its criteria and types, (2) to revise the master plan for library development so as to bolster the library services for low-income class, (3) to conduct a national-level survey of the low-income class status and of the library service status, (4) to determine the actual features of the low-income class and their information demands, and (5) to prepare library services tailored to the diverse types of low-income class.

A Study on the Research Trends on Literacy in Library and Information Science (문헌정보학 분야의 리터러시 연구 동향 분석)

  • Jang, Su Hyun;Nam, Young Joon
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.3
    • /
    • pp.263-292
    • /
    • 2022
  • The purpose of this study is to identify the topics of research related to the concepts of literacy in the field of Library and Information Science which is related to user education in libraries. Data were collected from the WoS and KCI databases, and complementary keyword analysis and topic modeling analysis techniques were used to identify topics of literature-related research articles in the field of Library and Information Science. Findings presented that there was a difference in keywords and topics between the two databases. Literacy-related topics identified from the KCI database were classified into three groups through topic modeling. Also, it was analyzed that there is a difference between the overall literacy-related research trend, the timing of the surge in research volume, and key frequent keywords in the Library and Information Science field confirmed in the study. In particular, in the study of literacy in all fields, a number of words such as 'literacy', 'education', 'media', and 'digital' were derived. However, in literature research in the field of Library and Information Science, keywords such as 'information utilization ability' and 'school library' appeared. Based on this, it was concluded that research on the ability to develop an evaluative eye for information is needed in line with today's information environment, where information is rapidly increasing in Korea in the future.

Research Trends on Japanese Confucianism and Kokugaku Thought in 2008 (2008년도 일본유학 및 국학사상 연구동향)

  • Lim, taihong
    • The Journal of Korean Philosophical History
    • /
    • no.29
    • /
    • pp.311-349
    • /
    • 2010
  • This report introduces the papers on Japanese Confucianism and Kokugaku thought written in Japanese, Korean, Chinese language and English during 2008. In this paper the data is based on the periodicals index databases of the digital libraries such as the National Diet Library of Japan, the China Academic Journal of China, the National Central Library of Taiwan and the National Assembly Library of Korea and so on. There were 42 articles published on the Japanese Confucian School. In the articles, 29 ones were written in Japanese, 7 in Korean, 4 in Chinese, and 2 in English. 54 articles were published on Yangming School, 41 written in Japanese, 2 in Korean, 10 in Chinese, 1 in English. 50 ones also published on Kohaku School or Mitogaku School. In the articles there were 32 ones written in Japanese, 7 in Korean, 9 in Chinese, 2 in English. And 58 ones on Kokugaku School were published, 51 were written in Japanese, 4 in Korean, 1 in Chinese, 2 in English. Totally 204 articles were written in Japanese, Korean, Chinese, or English language in 2008 throughout the world. This report is divided into 4 chapters, such as Chapter 1 - Syusigaku School, Chapter 2 - Youmeigaku school, Chapter 3 - Kohaku School and Mitogaku School and Chapter 4 - Kokugaku School. In each chapter, some articles are briefly introduced and some are in detail.

A Study on the Collection and Application Measures for Media Platform Based Materials (매체 플랫폼 기반 자료의 수집 및 적용 방안 연구)

  • Younghee Noh;Youngmi Jung;Aekyoung Son;Inho Chang;Hyunju Cha
    • Journal of Korean Library and Information Science Society
    • /
    • v.55 no.1
    • /
    • pp.193-214
    • /
    • 2024
  • This study aimed to propose a method for collecting and applying media platform based materials at the National Library of Korea. Firstly, we analyzed the current status and limitations of data collection based on domestic media platforms, including the National Library of Korea. Secondly, a literature review method was used to investigate the current status and types of digital content based on media platforms. Thirdly, we identified the types of materials based on media platforms that are not currently included in the National Central Library's online material collection guidelines through the examination of cases from major overseas libraries. Fourthly, after reviewing technical and legal elements such as the definition of collection targets and scope for each new media, and collection methods, we established collection criteria. Fifthly, based on the research results, the policies proposed in this study are as follows: 1) there is a need to establish a clear legal basis for the collection of media platform based materials; 2) the development and presentation of collection guidelines for media platform based materials is necessary; 3) the development of collection tools and infrastructure for media platform based materials is required; 4) for the collection of media platform based materials, it is necessary to obtain permission for collection from targeted social media organizations, and to cooperate in linkage with organizations that produce and service extended reality content; 5) for the service activation of media platform based materials, it is necessary to improve accessibility for the usage activation of these materials, to enhance the content extensibility and ease of use of the e-deposit system including extended reality content, and to advance and construct spaces for reproducing extended reality content.

The Way of Connecting to Tradition through Content (콘텐츠를 통해 전통을 잇는 방식 - 단원미술관 전시사례를 중심으로)

  • Kim, Sangmi
    • Trans-
    • /
    • v.9
    • /
    • pp.17-36
    • /
    • 2020
  • This study is aimed at discussing the possibility of content production, utilization and expansion, focusing on the exhibition case of Danwon Art Museum run by Ansan Cultural Foundation. In 1991, the Ministry of Culture, Sports and Tourism named Ansan as the City of Danwon since it is believed to be the hometown of Danwon Kim Hong-do (1745~?), a painter of the late Joseon Dynasty and a well-known master of genre painting. As a result, Ansan is making various efforts to utilize Danwon Kim Hong-do for its unique resource through internal and external business such as the creation of Danwon Sculpture Park, the operation of Danwon Art Museum, and the planning of Danwon Kim Hong-do Festival. However, the biggest problem with Ansan is that there are not many collections of Kim Hong-do. Ansan has owned a total of six works as of May this year: a deer and a boy, flowers and a bird, A view of clouds on the water, Daegwallyeong, Yeodongbin, A way to Singwangsa. Accordingly, Danwon Contents Center has set up a vision to systematically collect, preserve, and display various visual and artistic materials related to Kim Hong-do, offering high-quality information based on digital data. In other words, it is a complex cultural information agency of One-Source Multi-Use, which combines the functions of libraries, archives and art galleries so that visitors' desire is satisfied. It reflects the contemporary trend of overcoming the limitations of the ancient paintings and satisfying the role and function of the art museum. From the opening of the Danwon Contents Hall, the original work of Kim Hong-do has been interpreted and produced as media contents or recreated as a new form of art by modern artists. Exhibition using technologies such as touch screen and 'deep zoom' helps visitors to heighten their experience of the archives and get inside the world of the genius painter.

  • PDF

Ontology Design for the Register of Officials(先生案) of the Joseon Period (조선시대 선생안 온톨로지 설계)

  • Kim, Sa-hyun
    • (The)Study of the Eastern Classic
    • /
    • no.69
    • /
    • pp.115-146
    • /
    • 2017
  • This paper is about the research on ontology design for a digital archive of seonsaengan(先生案) of the Joseon Period. Seonsaengan is the register of staff officials at each government office, along with their personal information and records of their transfer from one office to another, in addition to their DOBs, family clan, etc. A total of 176 types of registers are known to be kept at libraries and museums in the country. This paper intends to engage in the ontology design of 47 cases of such registers preserved at the Jangseogak Archives of the Academy of Korean Studies (AKS) with a focus on their content and structure including the names of the relevant government offices and posts assumed by the officials, etc. The work for the ontology design was done with a focus on the officials, the offices they belong to, and records about their transfers kept in the registers. The ontology design categorized relevant resources into classes according to the attributes common to the individuals. Each individual has defined a semantic postposition word that can explicitly express the relationship with other individuals. As for the classes, they were divided into eight categories, i.e. registers, figures, offices, official posts, state examination, records, and concepts. For design of relationships and attributes, terms and phrases such as Dublin Core, Europeana Data Mode, CIDOC-CRM, data model for database of those who passed the exam in the past, which are already designed and used, were referred to. Where terms and phrases designed in existing data models are used, the work used Namespace of the relevant data model. The writer defined the relationships where necessary. The designed ontology shows an exemplary implementation of the Myeongneung seonsaengan(明陵先生案). The work gave consideration to expected effects of information entered when a single registered is expanded to plural registers, along with ways to use it. The ontology design is not one made based on the review of all of the 176 registers. The model needs to be improved each time relevant information is obtained. The aim of such efforts is the systematic arrangement of information contained in the registers. It should be remembered that information arranged in this manner may be rearranged with the aid of databases or archives existing currently or to be built in the future. It is expected that the pieces of information entered through the ontology design will be used as data showing how government offices were operated and what their personnel system was like, along with politics, economy, society, and culture of the Joseon Period, in linkage with databases already established.

Methods for Integration of Documents using Hierarchical Structure based on the Formal Concept Analysis (FCA 기반 계층적 구조를 이용한 문서 통합 기법)

  • Kim, Tae-Hwan;Jeon, Ho-Cheol;Choi, Joong-Min
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.3
    • /
    • pp.63-77
    • /
    • 2011
  • The World Wide Web is a very large distributed digital information space. From its origins in 1991, the web has grown to encompass diverse information resources as personal home pasges, online digital libraries and virtual museums. Some estimates suggest that the web currently includes over 500 billion pages in the deep web. The ability to search and retrieve information from the web efficiently and effectively is an enabling technology for realizing its full potential. With powerful workstations and parallel processing technology, efficiency is not a bottleneck. In fact, some existing search tools sift through gigabyte.syze precompiled web indexes in a fraction of a second. But retrieval effectiveness is a different matter. Current search tools retrieve too many documents, of which only a small fraction are relevant to the user query. Furthermore, the most relevant documents do not nessarily appear at the top of the query output order. Also, current search tools can not retrieve the documents related with retrieved document from gigantic amount of documents. The most important problem for lots of current searching systems is to increase the quality of search. It means to provide related documents or decrease the number of unrelated documents as low as possible in the results of search. For this problem, CiteSeer proposed the ACI (Autonomous Citation Indexing) of the articles on the World Wide Web. A "citation index" indexes the links between articles that researchers make when they cite other articles. Citation indexes are very useful for a number of purposes, including literature search and analysis of the academic literature. For details of this work, references contained in academic articles are used to give credit to previous work in the literature and provide a link between the "citing" and "cited" articles. A citation index indexes the citations that an article makes, linking the articleswith the cited works. Citation indexes were originally designed mainly for information retrieval. The citation links allow navigating the literature in unique ways. Papers can be located independent of language, and words in thetitle, keywords or document. A citation index allows navigation backward in time (the list of cited articles) and forwardin time (which subsequent articles cite the current article?) But CiteSeer can not indexes the links between articles that researchers doesn't make. Because it indexes the links between articles that only researchers make when they cite other articles. Also, CiteSeer is not easy to scalability. Because CiteSeer can not indexes the links between articles that researchers doesn't make. All these problems make us orient for designing more effective search system. This paper shows a method that extracts subject and predicate per each sentence in documents. A document will be changed into the tabular form that extracted predicate checked value of possible subject and object. We make a hierarchical graph of a document using the table and then integrate graphs of documents. The graph of entire documents calculates the area of document as compared with integrated documents. We mark relation among the documents as compared with the area of documents. Also it proposes a method for structural integration of documents that retrieves documents from the graph. It makes that the user can find information easier. We compared the performance of the proposed approaches with lucene search engine using the formulas for ranking. As a result, the F.measure is about 60% and it is better as about 15%.

A New Approach to Automatic Keyword Generation Using Inverse Vector Space Model (키워드 자동 생성에 대한 새로운 접근법: 역 벡터공간모델을 이용한 키워드 할당 방법)

  • Cho, Won-Chin;Rho, Sang-Kyu;Yun, Ji-Young Agnes;Park, Jin-Soo
    • Asia pacific journal of information systems
    • /
    • v.21 no.1
    • /
    • pp.103-122
    • /
    • 2011
  • Recently, numerous documents have been made available electronically. Internet search engines and digital libraries commonly return query results containing hundreds or even thousands of documents. In this situation, it is virtually impossible for users to examine complete documents to determine whether they might be useful for them. For this reason, some on-line documents are accompanied by a list of keywords specified by the authors in an effort to guide the users by facilitating the filtering process. In this way, a set of keywords is often considered a condensed version of the whole document and therefore plays an important role for document retrieval, Web page retrieval, document clustering, summarization, text mining, and so on. Since many academic journals ask the authors to provide a list of five or six keywords on the first page of an article, keywords are most familiar in the context of journal articles. However, many other types of documents could not benefit from the use of keywords, including Web pages, email messages, news reports, magazine articles, and business papers. Although the potential benefit is large, the implementation itself is the obstacle; manually assigning keywords to all documents is a daunting task, or even impractical in that it is extremely tedious and time-consuming requiring a certain level of domain knowledge. Therefore, it is highly desirable to automate the keyword generation process. There are mainly two approaches to achieving this aim: keyword assignment approach and keyword extraction approach. Both approaches use machine learning methods and require, for training purposes, a set of documents with keywords already attached. In the former approach, there is a given set of vocabulary, and the aim is to match them to the texts. In other words, the keywords assignment approach seeks to select the words from a controlled vocabulary that best describes a document. Although this approach is domain dependent and is not easy to transfer and expand, it can generate implicit keywords that do not appear in a document. On the other hand, in the latter approach, the aim is to extract keywords with respect to their relevance in the text without prior vocabulary. In this approach, automatic keyword generation is treated as a classification task, and keywords are commonly extracted based on supervised learning techniques. Thus, keyword extraction algorithms classify candidate keywords in a document into positive or negative examples. Several systems such as Extractor and Kea were developed using keyword extraction approach. Most indicative words in a document are selected as keywords for that document and as a result, keywords extraction is limited to terms that appear in the document. Therefore, keywords extraction cannot generate implicit keywords that are not included in a document. According to the experiment results of Turney, about 64% to 90% of keywords assigned by the authors can be found in the full text of an article. Inversely, it also means that 10% to 36% of the keywords assigned by the authors do not appear in the article, which cannot be generated through keyword extraction algorithms. Our preliminary experiment result also shows that 37% of keywords assigned by the authors are not included in the full text. This is the reason why we have decided to adopt the keyword assignment approach. In this paper, we propose a new approach for automatic keyword assignment namely IVSM(Inverse Vector Space Model). The model is based on a vector space model. which is a conventional information retrieval model that represents documents and queries by vectors in a multidimensional space. IVSM generates an appropriate keyword set for a specific document by measuring the distance between the document and the keyword sets. The keyword assignment process of IVSM is as follows: (1) calculating the vector length of each keyword set based on each keyword weight; (2) preprocessing and parsing a target document that does not have keywords; (3) calculating the vector length of the target document based on the term frequency; (4) measuring the cosine similarity between each keyword set and the target document; and (5) generating keywords that have high similarity scores. Two keyword generation systems were implemented applying IVSM: IVSM system for Web-based community service and stand-alone IVSM system. Firstly, the IVSM system is implemented in a community service for sharing knowledge and opinions on current trends such as fashion, movies, social problems, and health information. The stand-alone IVSM system is dedicated to generating keywords for academic papers, and, indeed, it has been tested through a number of academic papers including those published by the Korean Association of Shipping and Logistics, the Korea Research Academy of Distribution Information, the Korea Logistics Society, the Korea Logistics Research Association, and the Korea Port Economic Association. We measured the performance of IVSM by the number of matches between the IVSM-generated keywords and the author-assigned keywords. According to our experiment, the precisions of IVSM applied to Web-based community service and academic journals were 0.75 and 0.71, respectively. The performance of both systems is much better than that of baseline systems that generate keywords based on simple probability. Also, IVSM shows comparable performance to Extractor that is a representative system of keyword extraction approach developed by Turney. As electronic documents increase, we expect that IVSM proposed in this paper can be applied to many electronic documents in Web-based community and digital library.

A Study on the Research Trends in Library & Information Science in Korea using Topic Modeling (토픽모델링을 활용한 국내 문헌정보학 연구동향 분석)

  • Park, Ja-Hyun;Song, Min
    • Journal of the Korean Society for information Management
    • /
    • v.30 no.1
    • /
    • pp.7-32
    • /
    • 2013
  • The goal of the present study is to identify the topic trend in the field of library and information science in Korea. To this end, we collected titles and s of the papers published in four major journals such as Journal of the Korean Society for information Management, Journal of the Korean Society for Library and Information Science, Journal of Korean Library and Information Science Society, and Journal of the Korean BIBLIA Society for library and Information Science during 1970 and 2012. After that, we applied the well-received topic modeling technique, Latent Dirichlet Allocation(LDA), to the collected data sets. The research findings of the study are as follows: 1) Comparison of the extracted topics by LDA with the subject headings of library and information science shows that there are several distinct sub-research domains strongly tied with the field. Those include library and society in the domain of "introduction to library and information science," professionalism, library and information policy in the domain of "library system," library evaluation in the domain of "library management," collection development and management, information service in the domain of "library service," services by library type, user training/information literacy, service evaluation, classification/cataloging/meta-data in the domain of "document organization," bibliometrics/digital libraries/user study/internet/expert system/information retrieval/information system in the domain of "information science," antique documents in the domain of "bibliography," books/publications in the domain of "publication," and archival study. The results indicate that among these sub-domains, information science and library services are two most focused domains. Second, we observe that there is the growing trend in the research topics such as service and evaluation by library type, internet, and meta-data, but the research topics such as book, classification, and cataloging reveal the declining trend. Third, analysis by journal show that in Journal of the Korean Society for information Management, information science related topics appear more frequently than library science related topics whereas library science related topics are more popular in the other three journals studied in this paper.