DOI QR코드

DOI QR Code

데이터세트 보존포맷 검증방안에 관한 연구: 재난안전정보 데이터세트의 SIARD 적용을 통해

Empirical Verification of Conversion and Restoration of Preservation Format for Dataset: Application of Dataset with Disaster Safety Information to SIARD

  • 한희정 (전북대학교 문화융복합아카이빙 연구소) ;
  • 윤성호 (전북대학교 일반대학원 기록관리학과) ;
  • 오효정 (전북대학교 문헌정보학과, 문화융복합아카이빙 연구소) ;
  • 양동민 (전북대학교 일반대학원 기록관리학과, 문화융복합아카이빙 연구소)
  • 투고 : 2020.05.26
  • 심사 : 2020.06.22
  • 발행 : 2020.06.30

초록

정보의 활용이 국가 경쟁력의 핵심으로 부각되면서 우리 정부를 포함한 주요 선진국들은 데이터를 중요하게 인식하고 있으며, 이에 따라 장기보존 기술 연구 및 표준 제정 등을 추진하여 데이터의 체계적인 관리 및 보존을 위한 노력을 지속적으로 기울이고 있다. 그러나 현재 국내의 경우 다양한 유형의 데이터들에 대해 법령에는 기록관리 대상으로 명시하고 있지만, 이를 수집, 관리 및 보존하기 위한 구체적인 방법은 표준전자문서 이외에는 없는 상황이다. 특히, 행정정보시스템에서 생산되는 엄청난 규모의 데이터세트에 대한 관리 및 보존은 무엇보다 강하게 요구되어 왔으나 데이터세트에 대한 지침이 제대로 제공되고 있지 않고 있다. 보존포맷 선정체계가 마련되어야 시스템 보완 및 구축이 가능하기 때문에 우선적으로 데이터세트 특성을 고려한 보존포맷 선정 기준 체계가 보다 구체화 되어야 하며, 선정기준에 따라 도출된 데이터세트 보존포맷의 변환에 대한 실증적인 검증 작업이 필요하다. 이에 본 연구는 데이터세트의 특성을 고려한 보존포맷 선정 기준에 대한 평가체계를 도출하고, 보존포맷에 대한 실증적 검증을 통해 장기보존할 수 있는 방안을 제시하고자 한다.

As the use of information has emerged as the core of national competitiveness, major developed countries and the Korean government have realized the importance of data. They have pursued technical research and standard establishment for long-term preservation and continuously strived for systematic management and preservation of data. However, although various types of data are specified for the purpose of record management in the law, there is no specific method on how to collect, manage and preserve them, except standard electronic documents. In particular, management and preservation of huge datasets from the administrative information system have been strongly demanded above all. Any guidelines for datasets do not have been properly provided. After the framework for selecting preservation format must be prepared, the system can be supplemented and built. The framework considering the characteristics of the dataset should be specified more concretely, and empirical verification of the conversion and restoration for the dataset preservation format derived according to the selection criteria is necessary. Therefore, this study intends to propose a method for long-term preservation through empirical verification of the preservation format after deriving an evaluation the framework for the preservation format selection criteria considering the characteristics of the dataset.

키워드

참고문헌

  1. Kang, H. M. (2016). A study on the standardization of jpeg format as a long-term preservation master file for paper archives in the central archives of Korea. Journal of the Korean Library And Information Science Society, 47(4), 489-510. https://doi.org/10.16981/kliss.47.4.201612.489
  2. National Archives of Korea (2004). Electronic record permanent preservation based technology service. Daejeon: National Archives of Korea.
  3. National Archives of Korea (2013). A study on the reproduction technology and the prototype for the electronic records of administrative agency. Daejeon: National Archives of Korea.
  4. 국가기록원 (2017). 차세대 기록관리 모델 재설계 연구 개발 완료보고서. 대전: 국가기록원.
  5. Roh, J.-W., & So, J.-E. (2020). A study on the management plan for preservation and long-term use of datasets. Journal of D-Culture Archives, 3(1), 51-64.
  6. Park, J., & Lee, M. (2019). A study on the introduction of raw image file formats for the management of digital photographic records. Journal of Korean Society of Archives and Records Management, 19(3), 155-178. https://doi.org/10.14404/JKSARM.2019.19.3.155
  7. Seong, H. H. (2007). A study on document preservation format classified by the type for long-term preservation and use of electronic records. Master's thesis, Hankuk University of Foreign Studies. Seoul.
  8. So, J. E. (2019). A study on derivation critical factor for selection of dataset preservation format: Focus on dataset of relational database. Master's thesis, Jeonbuk National University of Graduate School. Jeonju.
  9. Song, C.-H., & Cha, H.-C. (2017). A study on the risk evaluation of electronic records for long-term preservation. Journal of The Korea Society of Computer and Information Winter Conference, 25(1), 29-30.
  10. Oh, S.-L., Park, S., & Yim, J. H. (2018). A case study of dataset records in information management system. Journal of Korean Society of Archives and Records Management, 18(2), 109-133. https://doi.org/10.14404/JKSARM.2018.18.2.109
  11. Oh, S.-L., & Rieh, H.-Y. (2019). Managing data set in administrative information systems as records. Journal of Korean Society of Archives and Records Management, 19(2), 51-76. https://doi.org/10.14404/JKSARM.2019.19.2.051
  12. Oh, S.-L., Jung, M. R., & Yim, J. H. (2016). Redesigning electronic records preservation formats based on open formats. Journal of Korean Society of Archives and Records Management, 16(4), 79-120. https://doi.org/10.14404/JKSARM.2016.16.4.079
  13. Wang, H.-S., & Seol, M.-W. (2017). A study on managing dataset records in government information systems. Journal of Korean Society of Archives and Records Management, 17(3), 23-47. https://doi.org/10.14404/JKSARM.2017.17.3.023
  14. Lim, N., & Nam, Y. (2019). A study on the criteria for digitization of records. Journal of the Korean BIBLIA Society for library and Information Science, 30(3), 5-30. https://doi.org/10.14699/kbiblia.2019.30.3.005
  15. Korea Minisry of Government (2017). Act on activation of data-based administration. Bill number 11077. Korea Ministry of Government Legislation. Retreived from http://www.lawmaking.go.kr
  16. Cha, H.-C., & Song, C.-H. (2019). A risk assessment method for the long-term preservation of electronic records. Journal of Korea Multimedia Society, 22(1), 79-87. https://doi.org/10.9717/kmms.2019.22.1.079
  17. Korea Society of Archival Studies (2008). Dictionary of records and archival terminology. Seoul: Yuksa Bipyung Sa.
  18. Han, H.-J., Oh, H.-J., & Yang, D. (2020). A study on the selection of preservation format for long-term preservation of electronic records. Journal of Korean Society of Archives and Records Management, 20(1), 69-87. https://doi.org/10.14404/JKSARM.2020.20.1.069
  19. Ministry of the Interior and Safety (2018). Source data of public sector, preservation is mandatory. Press release. 2018.09.19.
  20. Ministry of the Interior and Safety, National Information Society Agency (2019). Statistical report on public sector information resources based on the EA in 2019.
  21. Hyun, M. (2005). A study on the management of dataset as records. Journal of the Korean Association of Records Management, 5(2), 103-124. https://doi.org/10.14404/JKSARM.2005.5.2.103
  22. eCH-0165 (2018). SIARD format specification. Version 2.1
  23. eCH-0233 (2019). Archivierung elektronischer steuerdaten und -akten der kantone. Version 1.0
  24. Essen M. V., Rooij, M. D., Roberts, B., & Dobbelsteen, M. V. D. (2011). Database preservation case study: Review. National Archives of the Netherlands.
  25. Giaretta, D., Matthews, B., Bicarregui, J., Lambert, S., Guercio, M., Michetti, G., & Sawyer D. (2009). Significant properties, authenticity, provenance, representation information andOAIS Information. Paper presented at the iPRES 2009: the Sixth International Conferenceon Preservation of Digital Objects, San Francisco, California.https://escholarship.org/uc/item/0wf3j9cw
  26. Knight, G. (2008). Framework for the definition of significant properties. The National Archives, InSPECT Project Document.
  27. Lindely, A. (2013). Database preservation evaluation report -SIARD vs. CHRONOS Preserving complex structures as databases through a record centric approach?. International Conference on Preservation of Digital Objects (iPres), Lisbon. https://doi.org/10.13140/2.1.3272.8005
  28. NARA (2009). Significant properties. Retrieved from https://www.archives.gov/files/era/acera/pdf/significant-properties.pdf
  29. The National Archives (2018. 5. 1). Significant properties. Retrieved from http://www.significantproperties.org.uk