Search | Korea Science

Lessons from Data Repository GDR (Geoscience Data Repository) Building Experience (데이터 리포지토리 GDR 구축 경험과 교훈)

Han, JongGyu
- Proceedings of the Korean Society for Information Management Conference
- /
- 2017.08a
- /
- pp.100-100
- /
- 2017
100년의 역사를 지닌 한국지질자원연구원(KIGAM)은 국내 유일의 지질자원 전문연구기관으로서 그간 생산한 조사 연구데이터는 우리나라 과학기술의 귀중한 역사적 학술적 가치가 큰 유산으로써 보존 가치가 매우 크다고 할 수 있다. 하지만 현재 KIGAM의 상황은 최종성과물 위주로 자료관리가 이루어지고 있으며, 조사 연구 과정에서 생산된 암석 토양 지하수샘플이나 조사 탐사장비를 통해 얻어지는 자료는 연구자 또는 연구실 팀에서 개별적으로 관리하고 있다. 이러한 자료관리체계는 자료의 공동 활용이 어렵고, 자료를 보유하고 있는 연구자의 퇴직이나, 조직개편으로 인한 팀 실의 분리 과정에서 자료의 손실과 훼손 가능성이 높고, 누가 어디에 어떤 자료를 무슨 형태로 보관하고 있는지 찾기 어려워 자료의 재활용도가 떨어질 뿐만 아니라, 이로 인한 중복 조사 연구 가능성도 배제할 수 없다. KIGAM은 지질자원분야 국가데이터센터 구축을 목표로 연구과정에서 생산되는 연구데이터의 체계적인 관리와 공유, 활용체계 구축을 위해 2015년도에 기획사업을 통해 중장기 로드맵을 포함한 추진전략을 수립하였으며, 한국과학기술정보연구원(KISTI)의 DataNest를 기술이전받아 지질자원 연구데이터 리포지토리 시스템(GDR: Geoscience Data Repository)를 개발하였다. GDR 시스템 개발을 위해 연구데이터 분류코드를 작성하였으며, 2016년부터 데이터관리계획(DMP: Data Management Plan)을 주요사업 연구계획서 양식에 포함시켜 제출하도록 하였다. 과거 KIGAM은 연구데이터를 수집, 관리하기 위해 몇 차례에 걸쳐 시도를 했지만 실패한 경험을 가지고 있다. 실패 요인에는 (1) 관련 정책, 제도, 조직, 인력, 예산 등 데이터 관리 인프라 부재, (2) 연구사업에서 생산된 데이터는 개인소유라는 인식 및 공유 의식 부족, (3) 데이터 관리 활동은 귀찮은 것이고, 시간 낭비라는 인식, (4) 데이터 관리 공개 공유 활동에 대한 보상체계 부재 등을 꼽을 수 있다. 즉, 제도를 포함한 인프라 부족과 경영진과 구성원의 인식부족이 제일 큰 원인으로 판단된다. 성공적인 연구데이터 관리를 위해서는 지속적이고 꾸준한 투자가 이루어져야 하지만 경영진의 의지에 따라 사업이 중단되기도 한다. 이러한 과거의 실패 요인에 대한 해결 없이 지난 1년 6개월 정도의 GDR 운영은 지지부진하였다. 이러한 문제점을 해결하기 위해서는 국가차원의 제도적 뒷받침이 따라야 한다. 즉 국가 R&D 성과물 관리차원에서 연구데이터를 주요 성과물로 관리해야 할 것으로 판단된다. 연구사업계획서에 DMP를 포함시키고, 연구주제 및 분야별로 데이터센터(혹은 데이터 리포지토리)를 지정하고, 국가 R&D에서 생산되는 연구데이터를 의무적으로 제출하도록 하는 것이다. 또한 데이터센터의 안정적이고 지속적인 운영을 위해 연구사업비 항목에 데이터 관리비를 신설하여 데이터센터의 운영비로 사용하도록 하면 예산문제도 어느 정도 해결 될 수 있을 것으로 본다. 또한 데이터 제출 및 인용도에 따라 데이터 생산부서 혹은 생산자에게 평가점수를 부여하는 등 보상체계 마련을 위한 연구도 필요할 것으로 보인다. 국가 R&D 연구데이터의 수집, 관리, 공유, 활용을 제대로 성공시키려면 국가 R&D 최고정책결정자의 지속적인 관심과 지원이 필수적이다.
PDF

A Study on Management of Scientific Technology Reference Data that is Produced during Research (연구중에 생산된 과학기술 참조데이터 관리에 관한 연구)

Chae, Kyun-Shik;Lee, Eung-Bong
- Journal of the Korean Society for Library and Information Science
- /
- v.37 no.4
- /
- pp.131-149
- /
- 2003
Scientific/technical data whose reliability is authorized have to include physical/chemical constants, authorized property value or authorized scientific statistical values so that the scientists in the same field use them continuously and repeatedly for their research. The data also have to be managed as Standard Reference Data that will be used in other area nationwide by analyzing, evaluating and authorizing the accuracy and reliability of the information with scientific methods. This study suggests a plan for establishing the National Center for Standard Reference Data as well as a necessity for managing the data produced in research process as a national Standard Reference Data. Also, a prototype of research information system for measurement standards is developed and described in detail for research information for measurement standard to be used as a reference data, then a plan to managing national reference data is suggested, in addition.
https://doi.org/10.4275/KSLIS.2003.37.4.131 인용 PDF

Web-based drone image data mangement system (웹 기반 드론 이미지 데이터 관리 시스템)

Sung-Jun Kim;Joon Woo;Tae-Young Hong
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2024.01a
- /
- pp.349-350
- /
- 2024
최근 몇 년간 드론 시장 규모 및 산업적 중요성이 증가하고 있으며, 농업, 산림, 건설, 환경 등 다양한 분야에서 활용되고 있다. 이러한 배경에서 다양한 발주처에서 여러 가지 목적으로 드론 서비스 기업과 협력하여 드론 이미지 데이터를 획득하는 사업을 진행하고 있다. 이 과정에서 발주처와 기업 간에 촬영 계획 협의 및 촬영 데이터 검수 등과 같은 상호 교류가 필요하며 원활한 정보 공유를 위한 시스템이 필요하다. 본 고에서는 드론을 활용한 데이터 획득 과정에 필요한 촬영 계획 수립 및 검토, 촬영 데이터의 검수 등을 위한 웹 기반 드론 이미지 관리 시스템을 개발하고 이에 대하여 설명한다. 본 고에서 개발된 시스템을 활용하면 드론 서비스 기업은 다양한 발주처와 협업시에 효과적인 정보 공유를 할 수 있을 것이다.
PDF

An Identification on Big Data Application Fields by Utilizing Journal Bibliographic Coupling Analysis (서지결합분석을 통한 빅데이터 활용 분야 연구)

Lee, Boram
- Proceedings of the Korean Society for Information Management Conference
- /
- 2016.08a
- /
- pp.19-22
- /
- 2016
본 연구는 빅데이터의 처리 저장 등과 같은 기술적 측면이 아닌 분석 활용적 측면에 초점을 맞춰 관련 학문분야를 파악하고 분야 간 지적구조를 규명하고자 하였다. 연구 결과 빅데이터 관련 연구들이 주제분야에 따라 명백한 차이를 보이고 있음을 확인할 수 있었다. 주제범주 분석을 통해 공학 기술(34.60%), 사회과학(25.24%), 자연과학(23.14%), 의학 보건학(14.85%) 등은 관련 연구가 비교적 고르게 분포되어 있지만, 인문학(1.69%)과 농업과학(0.21%)은 연구가 미비함을 알 수 있었다. 네트워크 분석 결과 사회과학 분야(31.58%)에 비해 공학 및 자연과학 분야(68.42%)의 빅데이터 연구가 더 활발함을 확인할 수 있었다. 또한 공학 및 자연과학 분야 연구들은 다양한 주제분야를 다루는 반면 사회과학 분야에서는 아직 한정된 주제분야에서 연구가 진행되고 있음을 알 수 있었다.
PDF

SM-GUI: A Grid-based XML Schema Management GUI (SM-GUI: 그리드 기반의 XML 스키마 관리를 위한 그래픽 유저 인터페이스)

Jiang, Mei-Lan;Jeong, Karp-Joo;Lim, Sang-Boem;Kim, Dong-Kwang
- Proceedings of the Korean Information Science Society Conference
- /
- 2007.10b
- /
- pp.345-349
- /
- 2007
과학연구 분야에서는 부단히 새로운 연구 과제를 제기되고 있으며 이에 대한 연구조건 및 연구방법을 다각도로 변화시키면서 보다 효과적인 연구결과를 얻기 위한 노력을 거듭하고 있다. 이와 같은 연구활동이 활발히 진행됨에 따라 급증하는 데이터를 효율적으로 관리할 필요성이 증가하였으며 다양한 데이터 저장소에 저장되어 있는 다량의 데이터를 통합관리하기 위한 연구가 계속되어 왔다. 이런 연구를 진행함에 있어서 데이터를 효율적으로 관리하고 데이터 구조, 형태의 변화에 유연하게 대처하기 위해서는 데이터모델 관리가 필요하다. 본 논문에서는 사용자로 하여금 다양한 저장소의 데이터 모델을 관리하고 이로부터 동적으로 데이터를 생성하여 통합 관리할 수 있는 XML Schema 관리 툴을 제공한다.
PDF

Study on the data management and service trends in the biobank (바이오뱅크 분야의 연구데이터 관리 및 서비스 동향에 관한 연구)

Park, Mi-Young;Ahn, In-Ja;Ahn, Kyung-Sook
- Proceedings of the Korean Society for Information Management Conference
- /
- 2017.08a
- /
- pp.98-98
- /
- 2017
본 연구의 목적은 바이오뱅크(생명연구자원 소재은행)분야의 연구데이터 관리 및 서비스 동향을 고찰하고 시사점을 도출하는데 있다. 바이오뱅크 분야의 연구데이터 관리 및 서비스와 관련하여 해외 선진 사례를 프로그램과 서비스 모델측면에서 조사하였다. 조사된 프로그램과 서비스 모델은 ATCC(American Type Culture Collection), BIobank Data Sharing (MIABIS 2.0 Core), WDCM(World Data Center for Microorganism/Word Federation for Culture Collection)이다. 각각의 프로그램과 서비스 모델에서 시사점을 도출하였으며, 본 연구에서는 이를 기반으로 국내 생명연구자원 바이오뱅크 분야의 연구데이터 관리 및 서비스를 위한 시사점을 제시하였다.
PDF

A Case Study on Metadata Extractionfor Records Management Using ChatGPT (챗GPT를 활용한 기록관리 메타데이터 추출 사례연구)

Minji Kim;Sunghee Kang;Hae-young Rieh
- Journal of Korean Society of Archives and Records Management
- /
- v.24 no.2
- /
- pp.89-112
- /
- 2024
Metadata is a crucial component of record management, playing a vital role in properly managing and understanding the record. In cases where automatic metadata assignment is not feasible, manual input by records professionals becomes necessary. This study aims to alleviate the challenges associated with manual entry by proposing a method that harnesses ChatGPT technology for extracting records management metadata elements. To employ ChatGPT technology, a Python program utilizing the LangChain library was developed. This program was designed to analyze PDF documents and extract metadata from records through questions, both with a locally installed instance of ChatGPT and the ChatGPT online service. Multiple PDF documents were subjected to this process to test the effectiveness of metadata extraction. The results revealed that while using LangChain with ChatGPT-3.5 turbo provided a secure environment, it exhibited some limitations in accurately retrieving metadata elements. Conversely, the ChatGPT-4 online service yielded relatively accurate results despite being unable to handle sensitive documents for security reasons. This exploration underscores the potential of utilizing ChatGPT technology to extract metadata in records management. With advancements in ChatGPT-related technologies, safer and more accurate results are expected to be achieved. Leveraging these advantages can significantly enhance the efficiency and productivity of tasks associated with managing records and metadata in archives.
https://doi.org/10.14404/JKSARM.2024.24.2.089 인용 PDF

Design of Standard Metadata Schema for Computing Resource Management (컴퓨팅 리소스 관리를 위한 표준 메타데이터 스키마 설계)

Lee, Mikyoung;Cho, Minhee;Song, Sa-Kwang;Yim, Hyung-Jun
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2022.10a
- /
- pp.433-435
- /
- 2022
In this paper, we introduce a computing resource standard metadata schema design plan for registering, retrieving, and managing computing resources used for research data analysis and utilization in the Korea Research Data Commons(KRDC). KRDC is a joint utilization system of research data and computing resources to maximize the sharing and utilization of research data. Computing resources refer to all resources in the computing environment, such as analysis infrastructure and analysis software, necessary to analyze and utilize research data used in the entire research process. The standard metadata schema for KRDC computing resource management is designed by considering common attributes for computing resource management and other attributes according to each computing resource feature. The standard metadata schema for computing resource management consists of a computing resource metadata schema and a computing resource provider metadata schema. In addition, the metadata schema of computing resources and providers was designed as a service schema and a system schema group according to their characteristics. The standard metadata schema designed in this paper is used for computing resource registration, retrieval, management, and workflow services for computing resource providers and computing resource users through the KRDC web service, and is designed in a scalable form for various computing resource links.
PDF

A Study on the Intellectual Structure of Data Science Using Co-Word Analysis (동시출현단어분석을 통한 데이터과학 분야의 지적구조에 관한 연구)

Kim, Hyunjung
- Journal of the Korean Society for information Management
- /
- v.34 no.4
- /
- pp.101-126
- /
- 2017
Data Science is emerging as a closely related field of study to Library and Information Science (LIS), and as an interdisciplinary subject combining LIS, statistics and computer science in an attempt to understand the value of data by applying what LIS has been doing for collecting, storing, organizing, analyzing, and utilizing information. To investigate which subject fields other than LIS, statistics, and computer science are related to Data Science, this study retrieved 667 materials from Web of Science Core Collection, extracted terms representing Web of Science Categories, examined subject fields that are studying Data Science using descriptive analysis, analyzed the intellectual structure of the field by co-word analysis and network analysis, and visualized the results as a Pathfinder network with clustering created with the PNNC clustering algorithm. The result of this study might help to understand the intellectual structure of the Data Science field, and may be helpful to give an idea for developing relatively new curriculum.
https://doi.org/10.3743/KOSIM.2017.34.4.101 인용 PDF KSCI

후기 유전체 시대의 생명정보학과 데이터베이스의 연구방향

Ryu, Geun-Ho
- Journal of Scientific & Technological Knowledge Infrastructure
- /
- s.12
- /
- pp.108-117
- /
- 2003
대용량의 생명정보 데이터의 분석과 새로운 지식 추출을 지원하기 위해 생명정보 데이터의 저장과 관리를 책임지는 데이터베이스의 역할은 생명정보학 분야에서 더욱 중요해 지고 있다. 그러나 최근 새롭게 출현한 생명정보 데이터의 저장 관리는 생명 정보 데이터의 고유한 특성과 이러한 데이터의 특성을 이해하기 위한 생물학적 지식의 요구로 인해 많은 문제점과 어려움이 발생되고 있다. 따라서 이 원고에서는 생명정보 데이터를 관리하기 위해 데이터베이스 분야의 최근 연구 현황과 연구방향을 제시한다.
PDF

Search Result 1,065, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)