• Title/Summary/Keyword: 데이터 선별

Search Result 583, Processing Time 0.031 seconds

A Preliminary Study on Extending OAK Metadata for Research Data (연구데이터 관리를 위한 OAK 메타데이터 확장 방안 연구)

  • Lee, Mihwa;Lee, Eun-Ju;Rho, Jee-Hyun
    • Journal of Korean Library and Information Science Society
    • /
    • v.51 no.3
    • /
    • pp.27-51
    • /
    • 2020
  • This study aims to propose an extended OAK metadata for research data that would be described in OAK, an open access repository of the National Library of Korea. As a research method, literature review, case studies, and interviews with related parties were conducted. The method of extending the existing OAK metadata for research data was derived as follows. First, in modeling for research data, the structure of the collection> item> file is maintained, the collection is placed as a higher group to which the research data can be grouped, and item was combined metadata and files or digital objects of various formats together. Second, by mapping the metadata standard and case organizations with the existing OAK metadata, elements judged to need to be extended to OAK for research data were selected and reflected in the existing OAK. Third, the controlled vocabulary and syntax are also proposed so that it can be used for search or later statistics through structured data. By expanding the OAK metadata to describe research data, research data produced in Korea can be officially stored and used, which is the basis for preventing duplication of research and sharing and recycling research results nationally.

A Study on the Improvement of the Legal System for the Promotion of Opening and Utilization of Open Government Data - Focusing on cases of refusal to provide - (공공데이터의 개방·활용 촉진을 위한 법제도 개선방안 연구 - 공공데이터 제공거부 사례를 중심으로 -)

  • Kim Eun-Seon
    • Informatization Policy
    • /
    • v.30 no.2
    • /
    • pp.46-67
    • /
    • 2023
  • There are criticisms that, despite the proactive government policy on open government data (hereinafter "open data"), certain highly demanded data remains restricted due to legal constraints. In this study, we aim to analyze the factors that limit the opening and utilization of open data, focusing on cases wherein requests for open data provision have been denied. We will explore possible approaches that are in harmony with the Open Data Law while examining the constitutional value of open data, considering the foundational Open Data Charter that underpins the government's data policy. We will also examine cases wherein requests for data provision have been denied for institutional reasons, with nearly half of these cases involving open data that includes personal information. It is necessary to explore the potential for improvement in these cases. Furthermore, considering the recent amendment to the Personal Information Protection Act, which allows for the processing of pseudonymous information without the consent of the data subject for limited purposes, it is an opportune time to consider the need for amending the Open Data Law to facilitate broader access and utilization of open data for the nation. Lastly, we will propose institutional improvement directions aligned with the opening and utilization of open data by examining the constraints of and need for improvement in the selected target laws.

Selective Cache Consistency Scheme to Enlarge Autonomy of Mobile Host in Mobile Computing Environments (이동 컴퓨팅 환경에서 이동 호스트의 자치성 증대를 위한 선택적 캐쉬 일관성 유지 기법)

  • Kim, Hee-Sook;Hwang, Byung-Yeon
    • The KIPS Transactions:PartD
    • /
    • v.10D no.4
    • /
    • pp.655-660
    • /
    • 2003
  • The cache used by mobile host is an important device that recovers the weak points of limited power and bandwidth, in mobile computing environments. However, it has to stand and maintain the consistency with the server data. In this paper, we propose a 'Selective Cache Consistency Scheme'. The server allows an effective broadcasting by selecting data of high usability using 'Cache State Table' and 'Data Access Table'. Moreover, this scheme prevents the loss of data that nay occur by a long period of disconnection, by asynchronous broadcasting and transmitting those broadcast data preserved in the server. This also allows user to possess the latest data. Through experiments, we have found that the enlargement of autonomy is possible by reducing the dependence of server.

Mission Task & Workload Analysis of Armed Helicopter (무장헬기 임무절차 수립 및 임무하중 분석 연구)

  • Park, Hyojin;Lee, Jinwoo;Lee, Minwoo;Park, Sang C.;Kwon, Yongjin;Lee, Jonghoon
    • Journal of the Korea Society for Simulation
    • /
    • v.21 no.4
    • /
    • pp.25-33
    • /
    • 2012
  • Armed helicopter is an integral part of armed forces, which conducts vital missions, such as anti-armor attack, close air support, escorting air assault operations, and reconnaissance. A typical cockpit arrangement of armed helicopters has been a tandem configuration. This is to reduce the frontal area, which in turn increases the forward speed as well as reduces the chance of being hit by enemy fires. However, many armed helicopters in the world are now being developed as a side-by-side configuration. Such configuration is quite different from the conventional cockpit arrangement in light of the crew communications and situational awareness. Therefore, the main objective of this study is to find the optimized combination of mission tasks among pilots in a side-by-side configuration cockpit by measuring the workload using the NASA Task Load Index method. The experimental results indicate that the workload of crew members differ as disparate tasks are being performed.

Ontology-based Customized Health Management Service for Metabolic Syndrome Patients (대사 증후군 환자들을 위한 온톨로지 기반 맞춤형 건강관리 서비스)

  • Lee, Byung-Mun;Lee, Young-Ho;Yu, Ki-Min;Park, Ji-Yoon;Kang, Un-Gu
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.1
    • /
    • pp.41-52
    • /
    • 2012
  • According to 2005 Korea National Health and Nutrition Survey, it has been reported that 32.9% men and 31.8% women have Metabolic syndrome among the population of age 30 and over. The importance of prevention and management is being emphasized in Metabolic syndrome which is a complex disease related to various generic and environmental factors like other chronical disease. In this study we suggest an service based on the data using the system architecture, ontology and Jena2.0 inference engine and organizing the disease-related guideline. The study also arrives at the result through proper interpretation and reasoning process using health management service model based on ontology. The accuracy according to the situation was tested and 930 data samples were selected and experimented. We drew a conclusion that the much personalized data is available, the more personalized services are possible. Since the risk factors of Metabolic syndrome are various, it would be effective to suggest customized services based on various personalized data.

Design and Development of MIMIC regarding Telemetry in LEO Satellites (저궤도 관측위성에서의 원격 측정 데이터 관련 MIMIC 설계 및 구현)

  • Huh, Yun-Goo;Kim, Young-Yun;Cho, Seung-Won;Choi, Jong-Yeoun
    • Aerospace Engineering and Technology
    • /
    • v.11 no.1
    • /
    • pp.42-48
    • /
    • 2012
  • The telemetry data received from satellite in real-time are used to monitor LEO satellite during the AIT (Assembly, Integration & Test) phase and the mission operation phase after launch. However, it is impossible to check all the incoming telemetry data from satellite in real time in order to detect abnormality of satellite quickly. Especially, the contact time of LEO satellite is limited because of its orbital characteristics. So the anomaly state of the LEO satellite should be detected and resolved during the contact time. Therefore, all incoming spacecraft telemetry data must be selected and manipulated in MIMIC. It is used in order to display summarized information about spacecraft in a visualized way that is quickly and easily understood. That is, it provides essential function to monitor a satellite both in orbit and during testing. In this paper, the design and development of MIMIC currently used in KOMPSAT, a LEO Earth observation satellite is described in detail. In future work, we plan to enhance MIMIC in order to improve user-friendliness and efficiency.

A Cache Management Technique for an Efficient Video Proxy Server (효율적인 비디오 프록시 서버를 위한 캐시 관리 방법)

  • Lee, Jun-Pyo;Park, Sung-Han
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.46 no.4
    • /
    • pp.82-88
    • /
    • 2009
  • Video proxy server which is located near clients can store the frequently requested video data in storage space in order to minimize initial latency and network traffic significantly. However, due to the limited storage space in video proxy server, an appropriate video selection method is needed to store the videos which are frequently requested by users. Thus, we present a virtual caching technique to efficiently store the video in video proxy server. For this purpose, we employ a virtual memory in video poky server. If the video is requested by user, it is loaded in virtual memory first and then, delivered to the user. A video which is loaded in virtual memory is deleted or moved into the storage space of video poxy sewer depending on the request condition. In addition, virtual memory is divided into each segment area in order to store the segments efficiently and to avoid the fragmentation. The simulation results show that the proposed method performs better than other methods in terms of the block hit rate and the number of block deletion.

Classification of Ovarian Cancer Microarray Data based on Intelligent Systems with Marker gene (선별 시스템 기반 표지 유전자를 포함한 난소암 마이크로어레이 데이터 분류)

  • Park, Su-Young;Jung, Chai-Yeoung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.3
    • /
    • pp.747-752
    • /
    • 2011
  • Microarray classification typically possesses two striking attributes: (1) classifier design and error estimation are based on remarkably small samples and (2) cross-validation error estimation is employed in the majority of the papers. A Microarray data of ovarian cancer consists of the expressions of thens of thousands of genes, and there is no systematic procedure to analyze this information instantaneously. In this paper, gene markers are selected by ranking genes according to statistics, popular classification rules - linear discriminant analysis, k-nearest-neighbor and decision trees - has been performed comparing classification accuracy of data selecting gene markers and not selecting gene markers. The Result that apply linear classification analysis at Microarray data set including marker gene that are selected using ANOVA method represent the highest classification accuracy of 97.78% and the lowest prediction error estimate.

Comparison and analysis of multiple testing methods for microarray gene expression data (유전자 발현 데이터에 대한 다중검정법 비교 및 분석)

  • Seo, Sumin;Kim, Tae Houn;Kim, Jaehee
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.5
    • /
    • pp.971-986
    • /
    • 2014
  • When thousands of hypotheses are tested simultaneously, the probability of rejecting any true hypotheses increases, and large multiplicity problems are generated. To solve these problems, researchers have proposed different approaches to multiple testing methods, considering family-wise error rate (FWER), false discovery rate (FDR) or false nondiscovery rate (FNR) as a type I error and some test statistics. In this article, we discuss Bonferroni (1960), Holm (1979), Benjamini and Hochberg (1995) and Benjamini and Yekutieli (2001) procedures based on T statistics, modified T statistics or local-pooled-error (LPE) statistics. We also consider Sun and Cai (2007) procedure based on Z statistics. These procedures are compared in the simulation and applied to Arabidopsis microarray gene expression data to identify differentially expressed genes.

Development of Hydrometeorological Information and Application Technology for Monitoring Water Resources in North Korea (북한지역 수자원 감시예측을 위한 수문기상정보 활용기술개발)

  • Kim, Ji-in;Lee, Sungjin;Kang, Jaewon;Kim, Gyumum;Suh, Ae-sook
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2015.05a
    • /
    • pp.531-535
    • /
    • 2015
  • 본 연구에서는 한반도 관측 공백지역인 북한지역에 대하여 레이더와 위성 원격탐사자료를 활용하여 강수량과 토양수분 등 수문기상정보를 생산 및 검증하고 효율적인 수문 모니터링 및 수문 기상 재해 감시와 평가 방안을 수립하고자 한다. 또한, 북한지역의 수문 기상 정보 수집 및 통합 DB를 마련하고 북한 수문기상 포털시스템을 구축함으로써 부처 간 자료를 공유할 수 있는 매개체를 마련하여 일관된 정책 수립과 효율적인 물관리를 도모하고자 한다. WPMM(Window Probability Matching Method)방법을 기반으로 구성된 RAD-RAR(Rain rate system) 산정 알고리즘(Rosenfeld et al., 1993)을 활용하여 산출된 합성 강우장 데이터의 정확성을 비교 분석하기 위해 접경지역 AWS 강수량과 세계기상통신망(GTS)기반 강수량을 산출하여 각각 레이더 강수량과 검증분석을 실시하였다. 연구기간은 2012년과 2013년 여름철 기간 중 5개의 기간을 선별하였다. 연구 기간 동안의 RAR 합성 강우장 데이터를 이용하여, 기간 중 1시간 동안 누적된 강수량을 산출하고 접경지역 AWS 강수량과 비교하였고 12시간 누적 강수량을 산출하여 GTS 강수량과 비교 분석을 실시하였다. 전반적으로 레이더 강수량에 비해 AWS 강수량이 더 높게 나타났으며 마찬가지로 레이더 강수량과 GTS 강수량의 비를 통해 레이더 자료가 상대적으로 과소추정되고 있음을 확인 할 수 있었다. 미항공우주국(NASA)과 일본항공우주국(JAXA)을 중심으로 진행된 GPM(Global Precipitation Measurement)미션은 한 개의 핵심위성과 마이크로파 복사계를 탑재한 10여개의 보조위성으로 구성되어 있으며, 매 3시간 간격의 전구 강수량 자료 생산에 목적이 있다. 이는 홈페이지를 통해 Level 1, 2, 3의 GPM 데이터를 배포하고 있다. 특히 Level 2 데이터는 언급된 3시간 간격의 전구 강수량 데이터를 제공한다. 이 경우 복사량을 강수량으로 변환하는 번거로움을 덜 수 있으며 NASA가 제공하는 Panoply라는 프로그램을 이용하여 한반도 강수 자료 가시화가 가능하다.

  • PDF