• Title/Summary/Keyword: 레코드화

Search Result 96, Processing Time 0.02 seconds

A Study on Applicability of Machine Learning for Book Classification of Public Libraries: Focusing on Social Science and Arts (공공도서관 도서 분류를 위한 머신러닝 적용 가능성 연구 - 사회과학과 예술분야를 중심으로 -)

  • Kwak, Chul Wan
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.32 no.1
    • /
    • pp.133-150
    • /
    • 2021
  • The purpose of this study is to identify the applicability of machine learning targeting titles in the classification of books in public libraries. Data analysis was performed using Python's scikit-learn library through the Jupiter notebook of the Anaconda platform. KoNLPy analyzer and Okt class were used for Hangul morpheme analysis. The units of analysis were 2,000 title fields and KDC classification class numbers (300 and 600) extracted from the KORMARC records of public libraries. As a result of analyzing the data using six machine learning models, it showed a possibility of applying machine learning to book classification. Among the models used, the neural network model has the highest accuracy of title classification. The study suggested the need for improving the accuracy of title classification, the need for research on book titles, tokenization of titles, and stop words.

A Preliminary Study for Revision of KORMARC Bibliographic Format (KORMARC 통합서지용 개정을 위한 기초연구)

  • Rho, Jee-Hyun;Lee, Mihwa
    • Journal of Korean Library and Information Science Society
    • /
    • v.53 no.1
    • /
    • pp.149-170
    • /
    • 2022
  • This study aims to present a revised draft of KORMARC bibliographic format that was first revised in 2014. To this end, data collection and analysis methods are as follows. First, the opinions of cataloging working groups for revision of KORMARC including the National Library of Korea (NLK) and KERIS were collected and analyzed. Second, work guidelines used in libraries or cooperative catalog systems were analyzed. Third, by comparing MARC21 update No.32 (June 2021) with the KORMARC bibliographic format, elements that need to be reflected in the KORMARC revision were derived. Finally, issues that have been raised in previous studies or that require further discussion were examined. Based on these analysis results, an outlines of the extension and deletion of KORMARC fields, subfields, and indicators, and supplementation of application guidelines and examples were derived. The final revision directions were finalized after collecting expert opinions, the working group reviews, and comprehensive opinions from the library community.

Improving the Quality of Bibliographic Data in Public Libraries: Focusing on Public Libraries in Busan Metropolitan City (공공도서관 서지데이터의 품질 제고 방안)

  • Jee-Hyun Rho;Eun-Ju Lee
    • Journal of Korean Library and Information Science Society
    • /
    • v.54 no.3
    • /
    • pp.105-128
    • /
    • 2023
  • In 2020, the Busan metropolitan library took the lead in establishing an integrated library system (ILS) that integrates bibliographic data from 49 public libraries and 103 small public libraries. However, each library still builds bibliographic data individually and repeatedly, and the bibliographic data built by each library is only physically stored in an integrated DB. Therefore the improvement in work efficiency or data quality has not been achieved. This study aimed to analyze the construction processes and quality of bibliographic data in Busan public libraries and to suggest a new implementation strategy for an integrated environment. To this end, (1) the construction process of bibliographic data was investigated, (2) the quality of the constructed bibliographic data was objectively analyzed, and (3) four implementation strategies were suggested based on critical problems. The implementation strategy aims not only to improve the quality of bibliographic data, but also to increase work efficiency and build an infrastructure for data sharing.

Performance Comparison of Clustering using Discritization Algorithm (이산화 알고리즘을 이용한 계층적 클러스터링의 실험적 성능 평가)

  • Won, Jae Kang;Lee, Jeong Chan;Jung, Yong Gyu;Lee, Young Ho
    • Journal of Service Research and Studies
    • /
    • v.3 no.2
    • /
    • pp.53-60
    • /
    • 2013
  • Datamining from the large data in the form of various techniques for obtaining information have been developed. In recent years one of the most sought areas of pattern recognition and machine learning method is created with most of existing learning algorithms based on categorical attributes to a rule or decision model. However, the real-world data, it may consist of numeric attributes in many cases. In addition it contains attributes with numerical values to the normal categorical attribute. In this case, therefore, it is required processes in order to use the data to learn an appropriate value for the type attribute. In this paper, the domain of the numeric attributes are divided into several segments using learning algorithm techniques of discritization. It is described Clustering with other data mining techniques. Large amount of first cluster with characteristics is similar records from the database into smaller groups that split multiple given finite patterns in the pattern space. It is close to each other of a set of patterns that together make up a bunch. Among the set without specifying a particular category in a given data by extracting a pattern. It will be described similar grouping of data clustering technique to classify the data.

  • PDF

LRM's Characterics and Applications Plan Through Comparing with FRBR (FRBR과 비교를 통한 LRM의 특징 및 적용방안)

  • Lee, Mihwa
    • Journal of Korean Library and Information Science Society
    • /
    • v.53 no.2
    • /
    • pp.355-375
    • /
    • 2022
  • This study is to grasp LRM's feature and applications plan to reflect LRM to cataloging related standards and individual system through comparing and analyzing LRM with the FR model in terms of entities, attributes, and relationships. The application plan is suggested as follows. First, the entity can be extended by defining sub-entities of each entity in the standards and the individual system in order to reflect LRM, even though entities such as families, groups, identifiers, authorized access points, concepts, objects, events, agency and rules have been deleted in LRM. Second, the attribute should be subdivided in the standards and the individual system in order to apply LRM, though many attributes have been changed to relationships for linked data and decreased in LRM. In particular, more specific and detailed property names in the standards and the individual system should be clearly presented, and the vocabulary encoding scheme corresponding to each property should be also developed, since properties with similar functions or repetition in various entities, and material specific properties are generalized and integrated into comprehensive property names. Third, the relationship should be extended through newly declaring the refinement or subtype of the relationship and considering a multi-level relationship, since the relationship itself is general and abstract under increasing the number of relationships in comparing to the property. This study will be practically utilized in cataloging related standards and individual system for applying LRM.

A Study on the Comparative Analysis of the Description Rules of ISBD and KCR4 (ISBD 통합판과 KCR4 기술규칙 비교 연구)

  • Lee, Mihwa
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.24 no.2
    • /
    • pp.185-203
    • /
    • 2013
  • This study was to suggest the new rules for revision of KCR4 by comparing between ISBD consolidated edition and KCR4. The study methods was to compare the rules in each element after mapping the description elements in each area of ISBD and KCR4. Resultingly, first, content forms and media types must be included for describing resource types. Second, it is needed for rules about the common title and the dependent title. Third, it is needed for rules about "parallel" such as parallel title, parallel other title information, parallel statement of responsibility relating to title, parallel edition statement, parallel statement of responsibility relating to edition, parallel numbering system, parallel place of publication, production and distribution, et. al. Fourth, the rules about material or type of resource specific area must be regulated in terms of the contents of the resource. Fifth, the home country principle must be not applied in describing the place of publication, production and distribution for the consistency. Sixth, it is needed to regulate the extent, other physical details, dimensions, and accompanying material statement for all materials instead of the material description according to material types. Seventh, rule number of notes must be agreed to number of main rules. Eighth, it is needed for detailed rules about resource identifier. This study might be contributed to revise the KCR4.

Spatial View Materialization Technique by using R-Tree Reconstruction (R-tree 재구성 방법을 이용한 공간 뷰 실체화 기법)

  • Jeong, Bo-Heung;Bae, Hae-Yeong
    • The KIPS Transactions:PartD
    • /
    • v.8D no.4
    • /
    • pp.377-386
    • /
    • 2001
  • In spatial database system, spatial view is supported for efficient access method to spatial database and is managed by materialization and non-materialization technique. In non-materialization technique, repeated execution on the same query makes problems such as the bottle-neck effect of server-side and overloads on a network. In materialization technique, view maintenance technique is very difficult and maintenance cost is too high when the base table has been changed. In this paper, the SVMT (Spatial View Materialization Technique) is proposed by using R-tree re-construction. The SVMT is a technique which constructs a spatial index according to the distribution ratio of objects in spatial view. This ratio is computed by using a SVHR (Spatial View Height in R-tree) and SVOC (Spatial View Object Count). If the ratio is higher than the average, a spatial view is materialized and the R-tree index is re-used. In this case, the root node of this index is exchanged a node which has a MBR (Minimum Boundary Rectangle) value that can contains the whole region of spatial view at a minimum size. Otherwise, a spatial view is materialized and the R-tree is re-constructed. In this technique, the information of spatial view is managed by using a SVIT (Spatial View Information Table) and is stored on the record of this table. The proposed technique increases the speed of response time through fast query processing on a materialized view and eliminates additional costs occurred from repeatable query modification on the same query. With these advantages, it can greatly minimize the network overloads and the bottle-neck effect on the server.

  • PDF

ECG Baseline Wandering Removing Algorithm using Slope analysis and Curve Point Detection (기울기 분석과 굴곡점 검출을 이용한 ECG 기저선 잡음 제거 알고리즘)

  • Cho, Ik-Sung;Kwon, Hyeog-Soong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.14 no.9
    • /
    • pp.2105-2112
    • /
    • 2010
  • The noise component of electrocardiogram is not distributed in a certain frequency band. It is expressed in various types of signals by rater's physical and environmental conditions. Particularly, since the baseline wander is occurred by the mixture of the original signal and 0 ~ 2 [Hz] range of the frequency components according to muscle constraction of part attaching to an electrode and respiration rythm, it makes the ECG signal analysis difficult. Several methods have been proposed to eliminate the wandering effectually. However, they have some problems. In some methods, the high processing time is required due to the computational complexity, while in other cases ECG signal morphology can be distorted. This paper suggests a simple and effective algorithm that eliminates baseline wander with low computational complexity and without distorting signal morphology. First, the algorithm detects and segments a baseline wandering interval using slope analysis and curve point detection, second, approximates the wandering in the interval as a sinusoid, and then subtracts the sinusoid from signal. Finally, ecg signals without baseline wander are obtained. In order to evaluate the performance of the algorithm, several ECG signals with baseline wandering in MIT/BIH ECG database 101, 111, 113, 234 record were chosen and applied to the algorithm. It is found that the algorithm removes baseline wanders effectively without significant morphological distortion.

Design and Implementation of High-dimensional Index Structure for the support of Concurrency Control (필터링에 기반한 고차원 색인구조의 동시성 제어기법의 설계 및 구현)

  • Lee, Yong-Ju;Chang, Jae-Woo;Kim, Hang-Young;Kim, Myung-Joon
    • The KIPS Transactions:PartD
    • /
    • v.10D no.1
    • /
    • pp.1-12
    • /
    • 2003
  • Recently, there have been many indexing schemes for multimedia data such as image, video data. But recent database applications, for example data mining and multimedia database, are required to support multi-user environment. In order for indexing schemes to be useful in multi-user environment, a concurrency control algorithm is required to handle it. So we propose a concurrency control algorithm that can be applied to CBF (cell-based filtering method), which uses the signature of the cell for alleviating the dimensional curse problem. In addition, we extend the SHORE storage system of Wisconsin university in order to handle high-dimensional data. This extended SHORE storage system provides conventional storage manager functions, guarantees the integrity of high-dimensional data and is flexible to the large scale of feature vectors for preventing the usage of large main memory. Finally, we implement the web-based image retrieval system by using the extended SHORE storage system. The key feature of this system is platform-independent access to the high-dimensional data as well as functionality of efficient content-based queries. Lastly. We evaluate an average response time of point query, range query and k-nearest query in terms of the number of threads.

A Study on the Characteristics and Considerations of Bibliographic Description of ISBD Consolidated edition 2011 (ISBD 통합판의 서지기술 특징 및 고려사항에 관한 연구)

  • Lee, Mihwa
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.46 no.4
    • /
    • pp.169-188
    • /
    • 2012
  • This study aimed to analyze the characteristics of the bibliographic description of ISBD Consolidated edition published in 2011 and to grasp the considerations in applying the rules of ISBD consolidated edition to KCR4. For achieving this, the four aspects were analyzed such as the description area and data elements, the resource types, the punctuation, and the order of the elements of ISBD Consolidated edition(2011). The characteristics of ISBD Consolidated edition are as follows. First, the content form and the media type area are added in new 0 area and elements are designated by mandatory to confirm to FRBR. Second, content form, content qualification and media type replaced GMD in title and statement of responsibility area. Third, the prescribed punctuations were retained even when this results in double punctuation, and individual square brackets were preferred than entire square brackets when using square brackets to all elements in same area. Fourth, the order of elements in description was set out by patterns of data elements in areas, therefore could reduce the confusion of the order of elements. ISBD Consolidated edition as an international standard would make various rules to maintain the uniformity, but also respects the bibliographic practices of individual countries. Therefore, each country must revise its own rule to conform the ISBD Consolidated edition as well as reflect its unique situation. In Korea, since KCR4 was developed based on the previous edition of ISBD, it should be revised to confirm to the ISBD Consolidated edition. Therefore, this study is expected to contribute to the revision of KCR4.