• Title/Summary/Keyword: dataset records

Search Result 98, Processing Time 0.021 seconds

A Study of the Transition Process in Presidential Electronic Records Transfer and Improvement Measures : Focused on the Electronic Records of the 19th President Moon Jae-in's Administration (대통령 전자기록물의 이관방식 변천과 개선방안 연구 19대 문재인 정부 대통령 전자기록물을 중심으로 )

  • Yun, Jeonghun
    • The Korean Journal of Archival Studies
    • /
    • no.75
    • /
    • pp.41-89
    • /
    • 2023
  • Since the enactment of the Act on the Management of Presidential Archives in 2007, the cases of electronic records transfer in the 16th President Roh Moo-hyun's administration have played the role of an advance guard in managing public records and served as a test bed for new electronic records management. When transferring the electronic records of the 19th President Moon Jae-in's administration, the electronic records transfer method of President Roh's administration was inherited, while several innovative attempts were made. For instance, the Presidential Archives have for the first time converted the electronic documents from institutions advising the President into a long-term preservation package and transferred them online. In addition, considering the characteristics of the data, the administrative information dataset of the Presidential record creation institutions was transferred to the SIARD standard. Furthermore, the Presidential Archives had websites transferred in the form of OVF as a pilot test and collected social media directly through the API. Thus this study investigated the transition process of the presidential electronic records transfers from the 16th President Roh Moo-hyun's administration to the 19th President Moon Jae-in's. In addition, major achievements and issues were analyzed centering on the transfer method by type of electronic records during President Moon Jae-in's administration, and future improvement plans were presented.

An Exploratory Study for Utilization of Copyrighted Public Records and Provision of Customer-Centered Services (공공저작물 활용 및 수요자 중심의 서비스 제공을 위한 탐색적 연구 : 공공저작물 제공사이트를 중심으로)

  • Ryu, Me Ae;Ahn, Tae Ho
    • Journal of Information Technology Services
    • /
    • v.15 no.3
    • /
    • pp.223-245
    • /
    • 2016
  • This study defines copyrighted public records in broad sense including open government data and public domain except for some private records. Additionally, this study aims to investigate improvement plan for maximizing utilization of copyrighted public records in web-sites using customer side, without consideration of supplier side. For this purpose, qualitative study method was used with grounded theory on analyzed problems from literature review and case study. Literature review was concentrated on definition of open data and abroad utilization indicators whereas case study analyzed current situation of four web-sites providing copyrighted public records. Converged opinions from in-depth interview and various statistical data was analyzed as a basis for grounded theory, then a paradigm model was constructed and future improvement plans were presented. The findings imply that opening of copyrighted public records is not just important for quantitative results, rather it requires qualitative improvement providing latest credible information that is consistent with the demand of the customer. Thus, development of service platform and business models for copyrighted public records are urgent task.

A Study on Significant Properties for Dataset Type Preservation Format (데이터세트 유형 전자기록의 필수보존속성 연구)

  • Jung-eun Lee;Dongmin Yang
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.34 no.4
    • /
    • pp.259-283
    • /
    • 2023
  • This study acknowledges that prevailing regulation concerning for the long-term preservation of electronic records focus mainly on document types, neglecting the preservation of electronic records from various administrative information systems. With the growing interest in data management in the era of big data, it is imperative to establish clear standards for the long-term preservation of datasets. The choice of preservation format for electronic records is based on the specific standards for each type of electronic record. These standards are formulated according to the significant properties relevant to the electronic record type. This study aims to identify the significant properties of electronic records of each record type, before creating specific preservation format selection criteria for these record types. To achieve this, we reviewed and analyzed R&D studies by the National Archives of Korea and the NARA in the United States. As a result of the research, 9 significant properties were identified for database-type entities, and 7 significant properties were identified for structured data-type entities.

A Study of Redesigning Electronic Records Management Policies (전자기록관리정책의 재설계에 관한 연구)

  • Lee, Seung-eok;Seol, Moon-won
    • The Korean Journal of Archival Studies
    • /
    • no.52
    • /
    • pp.5-37
    • /
    • 2017
  • In consideration of the drastic transformation of records management environments, this study aims to suggest the directions for redesigning the electronic records management policies at a national level. First, it clarifies the four implicit objectives of electronic records management policies since the 2006 amendment of the Public Records Management Act, such as comprehensiveness for ensuring the appropriate management of any type of digital records, digital-friendly processes for records management, proper management for guaranteeing the evidential value of digital records, and long-term preservation of digital records. Second, it examines the challenging environmental factors in the areas since 2006. Third, it reviews the achievement of the policies as well as failures based on analyzing the policy documents and data from the National Archives of Korea. Fourth and finally, it suggests core areas and directions for redesigning the electronic records management policies, emphasizing the inclusiveness for data-type electronic records.

Investigating Non-Laboratory Variables to Predict Diabetic and Prediabetic Patients from Electronic Medical Records Using Machine Learning

  • Mukhtar, Hamid;Al Azwari, Sana
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.9
    • /
    • pp.19-30
    • /
    • 2021
  • Diabetes Mellitus (DM) is one of common chronic diseases leading to severe health complications that may cause death. The disease influences individuals, community, and the government due to the continuous monitoring, lifelong commitment, and the cost of treatment. The World Health Organization (WHO) considers Saudi Arabia as one of the top 10 countries in diabetes prevalence across the world. Since most of the medical services are provided by the government, the cost of the treatment in terms of hospitals and clinical visits and lab tests represents a real burden due to the large scale of the disease. The ability to predict the diabetic status of a patient without the laboratory tests by performing screening based on some personal features can lessen the health and economic burden caused by diabetes alone. The goal of this paper is to investigate the prediction of diabetic and prediabetic patients by considering factors other than the laboratory tests, as required by physicians in general. With the data obtained from local hospitals, medical records were processed to obtain a dataset that classified patients into three classes: diabetic, prediabetic, and non-diabetic. After applying three machine learning algorithms, we established good performance for accuracy, precision, and recall of the models on the dataset. Further analysis was performed on the data to identify important non-laboratory variables related to the patients for diabetes classification. The importance of five variables (gender, physical activity level, hypertension, BMI, and age) from the person's basic health data were investigated to find their contribution to the state of a patient being diabetic, prediabetic or normal. Our analysis presented great agreement with the risk factors of diabetes and prediabetes stated by the American Diabetes Association (ADA) and other health institutions worldwide. We conclude that by performing class-specific analysis of the disease, important factors specific to Saudi population can be identified, whose management can result in controlling the disease. We also provide some recommendations learnt from this research.

Effects of Keeping Financial Records on Financial Soundness of Households (가계부 기록이 가계의 재무건전성에 미치는 영향)

  • Son, Jiyeon;Park, Jooyung
    • Journal of Families and Better Life
    • /
    • v.34 no.3
    • /
    • pp.113-128
    • /
    • 2016
  • The Purpose of this study is to find the levels of keeping financial records among Korean households and to reveal the effect of keeping financial records on financial soundness of households. The 2014 Consumer Empowerment Index of the Korean consumer agency, which includes the surveyed results of 1,000 individuals, was analyzed as a secondary dataset. As a result, the following findings emerged during the study. First, 25.9% of consumers replied that they were keeping financial records. Factors associated with keeping financial records were gender and income. Women were more likely to keep financial records than men. Also, income had significant effects on keeping financial records. Second, levels of meeting percentages of financial ratios were highest in the debt to income ratio, which was 81.5%, and lowest in the investment ratio, which was 14.5%. Furthermore, 52.6% met the savings ratio, 40.6% met the emergency funds ratio, 24.6% met the retirement savings ratio. Meeting a percentage of the savings ratio did not fluctuated for 16 years, although the debt to income ratio has decreased around 15% since 1998. Third, keeping a household account book had signigicant influences on meeting percentages of financial ratios. Magnitudes of effects ranged between 1.4-1.8 odds, which were as much as the income effects. In summary, effects of keeping financial records were evidenced in this study. It is suggested that the importance of keeping financial records should be stressed in financial education and counseling programs.

Toward Developing a Provenance Conceptual Model for Data-driven Electronic Records (데이터형 전자기록을 위한 출처 개념 모델 개발 방향)

  • Hyun, Moonsoo
    • The Korean Journal of Archival Studies
    • /
    • no.79
    • /
    • pp.305-341
    • /
    • 2024
  • This study explored the possibilities of a new approach to developing the provenance concept to electronic records in the data-driven digital environments by reviewing and adopting data provenance concepts and models. It then conducted basic literature review to develop a ground for a model representing the provenance of data-driven electronic records. In particular, it proposed to embrace to the concepts of retrospective and prospective provenance, and to develop a different model for representing provenance from records management metadata. If the model can be developed that can represent provenance independently while maintaining a dynamic relationship with records, it can be ensure the fluidity of records and even support to secure the record's attributes and play the roles of provenance. Eventually, it proposed the direction to develop the provenance model which can support the fixity of records, the reproducibility of activities, and the trustworthiness of representations. It is expected to be a fit provenance model in the data-driven digital environment.

KOMUChat: Korean Online Community Dialogue Dataset for AI Learning (KOMUChat : 인공지능 학습을 위한 온라인 커뮤니티 대화 데이터셋 연구)

  • YongSang Yoo;MinHwa Jung;SeungMin Lee;Min Song
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.2
    • /
    • pp.219-240
    • /
    • 2023
  • Conversational AI which allows users to interact with satisfaction is a long-standing research topic. To develop conversational AI, it is necessary to build training data that reflects real conversations between people, but current Korean datasets are not in question-answer format or use honorifics, making it difficult for users to feel closeness. In this paper, we propose a conversation dataset (KOMUChat) consisting of 30,767 question-answer sentence pairs collected from online communities. The question-answer pairs were collected from post titles and first comments of love and relationship counsel boards used by men and women. In addition, we removed abuse records through automatic and manual cleansing to build high quality dataset. To verify the validity of KOMUChat, we compared and analyzed the result of generative language model learning KOMUChat and benchmark dataset. The results showed that our dataset outperformed the benchmark dataset in terms of answer appropriateness, user satisfaction, and fulfillment of conversational AI goals. The dataset is the largest open-source single turn text data presented so far and it has the significance of building a more friendly Korean dataset by reflecting the text styles of the online community.

A Study on the Established Requirements for Records through Precedent Analysis: Focusing on "Inter-Korean Summit Meeting Minutes Deletion" Cases (판례 분석을 통한 기록의 성립 요건 검토: '남북정상회담회의록 삭제' 판례를 중심으로)

  • Lee, Cheolhwan;Zoh, Youngsam
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.21 no.1
    • /
    • pp.41-56
    • /
    • 2021
  • This study aims to analyze the court ruling on "Inter-Korean Summit Meeting Minutes Deletion," identify how the established requirements, concept, and scope for the records prescribed in the Public Records Management Act are applied in actual cases, and summarize the future tasks. It analyzes the "approval theory" as the point of establishment for records by the ruling means and how the meaning of approval is determined, and examines the difference between the e-jiwon System and the On-Nara System to understand the meaning of ruling clearly. Moreover, it analyzes how the "Invalidity of Public Documents Crime" in Article 141 in the Criminal Act influences record management. Based on such comprehensive case analyses, the study proposes what tasks the administrative agencies such as the National Archives of Korea and the Ministry of the Interior and Safety should perform.

A Study on Developing a Provenance Conceptual Model for Data-driven Electronic Records Based on Extending W3C PROV (PROV의 확장에 기초한 데이터형 전자기록의 출처 모델 연구)

  • Hyun, Moonsoo
    • The Korean Journal of Archival Studies
    • /
    • no.80
    • /
    • pp.5-41
    • /
    • 2024
  • This study was conducted to develop a provenance representation model for data-type electronic records. It supports the distinction between provenance and context for the creation and management of data-type electronic records. To express both, it aims to design an extensible provenance model. For this purpose, W3C PROV is utilized as a basic model, with P-Plan and ProvONE for designing prospective provenance area. Afterward, the provenance model was extended by mapping the record management requirements. The provenance model proposed in this study is designed to represent and connect both retrospective and prospective provenance of data-type electronic records. Based on this study, it is expected to discussing the concept of provenance in the records management and archival studies area and to extending the model in the future.