DOI QR코드

DOI QR Code

A Study on Classification and Recovery Method of Damaged Electronic Records

손상된 전자기록물 구분과 복원 방법에 관한 연구

  • Received : 2018.11.27
  • Accepted : 2018.12.14
  • Published : 2019.02.28

Abstract

In this paper we propose a method to distinguish damaged electronic records and an electronic record recovery method according to damage type. The proposed classification engine and recovery engine classify damaged electronic records based on the form and structure of electronic records and increase the probability of recovery according to the damaged form. Through this process, we propose a method to minimize the electronic records that are destroyed and verify them through experiments.

Keywords

Classfication method of damaged electronic records;Recovery method of damaged electronic records;OLE File

JBBHCB_2019_v29n1_139_f0001.png 이미지

Fig. 1. Electronic records transfer process fromRMS to CAMS[1]

JBBHCB_2019_v29n1_139_f0002.png 이미지

Fig. 2. Structure of OLE File

JBBHCB_2019_v29n1_139_f0003.png 이미지

Fig. 3. Block of OLE File sector

JBBHCB_2019_v29n1_139_f0004.png 이미지

Fig. 4. Structure of PDF File

JBBHCB_2019_v29n1_139_f0005.png 이미지

Fig. 5. Recovered HML files

JBBHCB_2019_v29n1_139_f0006.png 이미지

Fig. 6. Extract text from corrupted files

JBBHCB_2019_v29n1_139_f0007.png 이미지

Fig. 7. Extract text and images from corruptedfiles

JBBHCB_2019_v29n1_139_f0008.png 이미지

Fig. 8. System configuration of corrupted fileseparator and recover

Table 1. Number and growth rate of damaged files in electronic records

JBBHCB_2019_v29n1_139_t0001.png 이미지

Table 2. Count by electronic record file type

JBBHCB_2019_v29n1_139_t0002.png 이미지

Table 3. Meaning of each sector information value

JBBHCB_2019_v29n1_139_t0003.png 이미지

Table 4. Code Assignment of corrupted file separator

JBBHCB_2019_v29n1_139_t0004.png 이미지

Table 5. Performance of corrupted file separator

JBBHCB_2019_v29n1_139_t0005.png 이미지

Table 6. Performance of corrupted file recover

JBBHCB_2019_v29n1_139_t0006.png 이미지

References

  1. RMS main function : http://www.archives.go.kr/archivesdata/upFile/palgan/1404206260657.pdf
  2. Object Linking and Embedding (OLE) Data Structures, https://msdn.microsoft.com/en-us/library/dd942265.aspx
  3. Extensible Markup Language (XML) 1.0 (Fifth Edition), https://www.w3.org/TR/xml/
  4. HWP File Format, https://www.hancom.com/etc/hwpDownload.do
  5. Portable document format - Part 1: PDF 1.7, https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf
  6. Jeewon Jang, "A Recovery Technique of PDF File in the Unit of Page", http://www.ndsl.kr/ndsl/commons/util/ndslOriginalView.do?cn=JAKO201710758143462&dbt=JAKO&koi=KISTI1.1003%2FJNL.JAKO201710758143462
  7. [MS-PPT]: PowerPoint (.ppt) Binary File Format https://msdn.microsoft.com/en-us/library/office/cc313106(v=office.12).aspx
  8. Karl Wust, "Force Open: Lightweight black box file repair," Proceedings of the Fourth Annual DFRWS Europe, pp. 75-82, January. 2017.