• Title/Summary/Keyword: 장서빈도

Search Result 33, Processing Time 0.018 seconds

A Study on Feature Selection for kNN Classifier using Document Frequency and Collection Frequency (문헌빈도와 장서빈도를 이용한 kNN 분류기의 자질선정에 관한 연구)

  • Lee, Yong-Gu
    • Journal of Korean Library and Information Science Society
    • /
    • v.44 no.1
    • /
    • pp.27-47
    • /
    • 2013
  • This study investigated the classification performance of a kNN classifier using the feature selection methods based on document frequency(DF) and collection frequency(CF). The results of the experiments, which used HKIB-20000 data, were as follows. First, the feature selection methods that used high-frequency terms and removed low-frequency terms by the CF criterion achieved better classification performance than those using the DF criterion. Second, neither DF nor CF methods performed well when low-frequency terms were selected first in the feature selection process. Last, combining CF and DF criteria did not result in better classification performance than using the single feature selection criterion of DF or CF.

A Keyword Analysis of Collection Development Policies of University and Public Libraries Using Text Mining (텍스트 마이닝을 활용한 대학도서관과 공공도서관의 장서개발 정책 키워드 분석)

  • Da-Hyeon Lee;Dong-Hee Shin
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.58 no.1
    • /
    • pp.285-302
    • /
    • 2024
  • For this article, we conducted frequency analysis, topic modeling, and network analysis on eleven texts related to collection development policy found in the National Library of Korea. We deduced the main keywords related to collection development policies and analyzed the relationship between them. We subsequently conducted a pie coefficient analysis to identify the characteristics of collection development policies of university libraries and public libraries by category. The results showed that keywords such as "material," "library," "collection development," "user," and "collection" were the main keywords in frequency analysis and network centrality. Meanwhile, the pie coefficient analysis revealed that keywords such as "university," "construction," "student," "target," and "cost" were prevalent in university libraries, indicating that the academic needs of users and the discussion of digital resources were primary issues, while keywords related to the information needs of various user groups-including "adults," "survey," "feature," and "religion" -appeared in public libraries.

Applicability of Two-Poisson Model to Korean Literature (2-포아송 모형의 한국어 문헌 적용성)

  • 최대식;정영미
    • Proceedings of the Korean Society for Information Management Conference
    • /
    • 1999.08a
    • /
    • pp.9-12
    • /
    • 1999
  • 통계적 확률이론에 근거한 포아송 모형을 색인어 선정 기반으로 활용하고자 하는 2-포아송 함수와 3-포아송 함수 및 다중 포아송 함수에 대한 단계적 발전 과정을 살펴보았다. 아울러, 2-포아송이 한국어 문헌의 색인어 선정에 유용한지 알아보기 위해 한국어 말뭉치 데이터베이스 내 문헌 50개를 실험 대상으로 단어의 장서빈도와 문헌빈도를 이용하여 z값을 산출해 보았다.

  • PDF

A Comparative Study of Automaic Indexing Techniques in Pharmacology and Libray & Infomation Science (학문의 주제별 특성에 따른 자동 색인 기법의 비교 연구 - 약학분야와 도서관. 정보학 분야를 중심으로 -)

  • 조수련;사공철
    • Journal of the Korean Society for information Management
    • /
    • v.5 no.2
    • /
    • pp.99-126
    • /
    • 1988
  • The purpose of this ptudy is to presenet a relevant automaitc technigue in accordance with the statistical term characteristie in a collection comprising different subjecits, by comparing and evaluating two automatic indexing technigues (Inverse Document Fregnency Weighting Technigue and Term Discrimiantion Value Weighting Technigues) intht fields of Pharmacology and Library & Information Science.

  • PDF

A Study on the Current Status and Improvement of Serial Management of Public Libraries (공공도서관의 연속간행물 장서관리 실태 및 개선 방안 연구)

  • Kim, Hyejin;Cha, Mikyeong
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.28 no.2
    • /
    • pp.245-271
    • /
    • 2017
  • Despite the importance and uses of serials in public libraries, the present condition of serial management is still behind. The purpose of this study is to investigate the problems and proposes the ways of improving serials management. For the purposes, this study conducted a literature review, case study of collection management policies of 8 regional central libraries and 3 public libraries abroad, and online questionnaire survey of 31 public libraries in Gyeong-gi province. Based on the results, this research suggests the need for extending the number and scope of serials subscriptions, improving access points and establishing management guidelines.

An Analytical Study on Research Trends of Collection Development and Management (장서개발관리 분야 최근 연구동향 분석에 대한 연구)

  • Shin, You Mi;Park, Ok Nam
    • Journal of the Korean Society for information Management
    • /
    • v.36 no.2
    • /
    • pp.105-131
    • /
    • 2019
  • The purpose of this study is to investigate the development direction of future scholarship by analyzing recent research trends in collection development and management field using keyword network analysis. Data was collected from four journals in library and information science field during period of 2003 to 2017. Related articles of Collection Development and Management field were retrieved, and author keywords were extracted from selected papers. Keyword network analysis using NetMiner4 program was performed based on frequency analysis, connection-centered analysis, and parametric analysis. The analysis covers all sections from 2003 to 2017 to look at the changes in research over time, and three sections on five-year basis. As a result, main keywords such as 'open access', 'institutional repository' and 'academic journals' were identified, and topics to be continuously researched were identified.

A Management Improvement Study by the Use Survey of an Academic Library - Focused on the Analysis of Circulation Records of the C-Academic Library Users - (대학도서관 이용조사를 통한 경영개선 연구 - C 대학도서관 이용자의 대출기록 분석을 중심으로-)

  • Yoo, Kyeong-Jong;Park, Il-Jong
    • Journal of the Korean Society for information Management
    • /
    • v.24 no.3
    • /
    • pp.93-117
    • /
    • 2007
  • The books and circulation-related data in the Library Automation System(LAS) of C-academic library were collected and analyzed, and also the method which may be applied to the Customer Relationship Management (CRM) based on the results was suggested in this paper. Collected data were 269,387 bibliographic data of books, 12,281 patron data, and 39,269 circulation records. User identity, circulation frequencies, total number of circulated books, and publication year as relation factor from the analyzed data of circulation records were extracted. They were also analyzed, and verified by correlation coefficient.

A Study on Obsolescence and Weeding by Citation Analysis - Application to Economics - (인용분석(引用分析)을 통한 문헌(文獻)의 이용률 감소현상(減少現象) 및 장서폐기(藏書廢棄) 연구 - 경제학(經濟學) 분야를 중심으로 -)

  • Shin, Eun-Ja
    • Journal of Information Management
    • /
    • v.24 no.4
    • /
    • pp.1-23
    • /
    • 1993
  • The purpose of this study is to investigate and analyze the obsolescence of documents according to their dates, types and locations of publication. The main results are as follows : The half-life of monographs is 12.09 years while those of articles and research papers are 9.68 and 8.93 years, respectively. Moreover, documents are most often cited by researchers within two years since their publication. Lastly, but not the least, the estimated weeding points for monographs, articles, and research papers, assuming that their weeding points have been realized when their accumulated citation rates reach 90%, are 40.15, 32.15, and 29.65 years, respectively.

  • PDF

An Acquisition Policy Study by the Use Survey of a Public Library: Focused on the Analysis of Circulation Records of the H-public Library Users in 2007 (이용조사를 통한 공공도서관의 수서정책에 관한 연구 - H도서관 이용자의 2007년 대출기록을 중심으로 -)

  • Yoo, Kyeong-Jong;Park, Il-Jong
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.42 no.2
    • /
    • pp.371-392
    • /
    • 2008
  • The books and circulation-related data in the Online Public Access Catalog System of the H-Public library were collected and analyzed in this paper. The methods which may be applied to the Customer Relationship Management in a public library based on the results were also suggested here. Collected data were 57,927 bibliographic data of books, 11,871 patron data and 27,145 circulation records. The type of collections. circulation frequencies, total number of circulated books, publication year, and use factor as relation factor from the analyzed data of circulation records were extracted. They were also analyzed, and verified by various statistical methods such as correlation coefficient, non-parametric method, etc.

A Comparative Study of the Use Characteristic of Public Library Collection in Urban and Rural Areas: Focused on the Circulation Data of Four Libraries in the Gyungsangnam-do Province (도시지역과 군지역에 위치한 공공도서관의 자료이용 특성에 관한 비교연구 - 경남지역 4개 공공도서관의 대출기록을 중심으로 -)

  • Yoo, Kyeong-Jong;Park, Il-Jong
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.20 no.1
    • /
    • pp.39-57
    • /
    • 2009
  • The two urban-area public libraries and two rural ones that are located in the Gyungsangnam-do province were selected for this paper, and the circulation records in 2007 were collected, and both MS-excel and SPSS were used for their analysis. The collected data were categorized into their collection type, circulation frequencies, and subjects. Also the four libraries were compared and analyzed again for the purpose of comparing the characteristics of the public libraries in urban and rural areas. The number of circulated books, lent number, use factor, and the number of publication lapse year were extracted and analyzed using various types of statistical methods such as correlation coefficient and nonparametric chi-square analysis, etc. as well as descriptive ones.