• Title/Summary/Keyword: document file format

Search Result 54, Processing Time 0.029 seconds

Selection Strategy for the File Format of Official Documents in a Cloud Environment (클라우드 환경에서 공문서 파일포맷의 선택 전략)

  • Yim, Jin Hee
    • The Korean Journal of Archival Studies
    • /
    • no.66
    • /
    • pp.5-35
    • /
    • 2020
  • In the digital age, the most basic unit of records management is a document file. Depending on whether each document file is produced in an open standard format or an exclusive proprietary format, there is a cost difference in the process of use and long-term preservation. As government agencies moved to cloud-based business management systems, web-based document editing software was newly adopted. They had the opportunity to select a new file format for web-based document editing software. This paper deals with strategies to maximize the advantages of open standard formats while considering the legacy public document production practices. It introduces ISO/IEC 26300:2006 ODF, ISO/IEC 29500:2008 OOXML, KS X 6101 OWPML, etc. as open standard formats, and looks at considerations when choosing a format and web-based document editing software. In addition, it proposes a strategy to maintain the format of document files shared in cloud storage as an open standard format.

A Study on Edit Order of Text Cells on the MS Excel Files (MS 엑셀 파일의 텍스트 셀 입력 순서에 관한 연구)

  • Lee, Yoonmi;Chung, Hyunji;Lee, Sangjin
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.24 no.2
    • /
    • pp.319-325
    • /
    • 2014
  • Since smart phones or tablet PCs have been widely used recently, the users can create and edit documents anywhere in real time. If the input and edit flows of documents can be traced, it can be used as evidence in digital forensic investigation. The typical document application is the MS(Microsoft) Office. As the MS Office applications consist of two file formats that Compound Document File Format which had been used from version 97 to 2003 and OOXML(Office Open XML) File Format which has been used from version 2007 to now. The studies on MS Office files were for making a decision whether the file has been tampered or not through detection of concealed items or analysis of documents properties so far. This paper analyzed the input order of text cells on MS Excel files and shows how to figure out what cell is the last edited in digital forensic perspective.

A Study on the online of PDF Electronic Documents System (인터넷 원거리출판의 응용과 PDF의 인쇄활용에 관한 연구)

  • 유영수;강영립;김병현;이광수
    • Proceedings of the Korean Printing Society Conference
    • /
    • 2001.06a
    • /
    • pp.63-77
    • /
    • 2001
  • PDF(Portable Document Format) is a file format that Adobe advances postscritp technique and use in managing document information or electric publishing(internet, CD-ROM, DVD). PDF is a devised document type for being able to read and print anywhere, independent of OS, printer type, resolution, and the kind of computer etc. Because this includes a compressing function, it transfers document through a small size of file in internet or intranet. In addition, that is a file format has various advantages-sharing of information and transfering documents in on line or off line environment. In this paper, we developed electronic document system using PDF format. Electronic document system consists of filter, automatic indexing, special searching system and web server. The information used in this paper is database made using Zwon\`s DocuCom. The filter recognizes various kinds of document structure. And according to property of document, it produces ASCII output. In addition to processing various formats of document, the filter can extract keywords in documents of MS WORD, Excel, Powerpoint, PDF, CAD etc. This filter uses the structure of window printer drive and can extract the information for text, page, font type and size from relevant document. The automatic indexing recognizes the formatted tag of document form ASCII text produced by filter and extracts adequate keyword to structure and property of document. PDF electronic document systems proposed in this paper can be used in Internet, PC communication. Users can choose and read electronic documents by two ways. First, users can choose and read relevant books using PDF electronic document homepage. Second, users can use PDF integrated-search system. User can search after inputing keyword and choose reference field and type of data. But, now, PDF products of Adobe can\`t support the Korean character. If this problem is resolved, we thick that PDF applications system looks active. Although there is limited function in case of using Zwon DocuCom used in this study, we think that there isn\`t a great deal of difficulty in electronic document and building digital database.

  • PDF

An effective detection method for hiding data in compound-document files (복합문서 파일에 은닉된 데이터 탐지 기법에 대한 연구)

  • Kim, EunKwang;Jeon, SangJun;Han, JaeHyeok;Lee, MinWook;Lee, Sangjin
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.25 no.6
    • /
    • pp.1485-1494
    • /
    • 2015
  • Traditionally, data hiding has been done mainly in such a way that insert the data into the large-capacity multimedia files. However, the document files of the previous versions of Microsoft Office 2003 have been used as cover files as their structure are so similar to a File System that it is easy to hide data in them. If you open a compound-document file which has a secret message hidden in it with MS Office application, it is hard for users who don't know whether a secret message is hidden in the compound-document file to detect the secret message. This paper presents an analysis of Compound-File Binary Format features exploited in order to hide data and algorithms to detect the data hidden with these exploits. Studying methods used to hide data in unused area, unallocated area, reserved area and inserted streams led us to develop an algorithm to aid in the detection and examination of hidden data.

The Design and Implementation of the ODIF Data Stream Translation System for Document Interchange (문서교환을 위한 ODIF 테이터 스트림 변환 시스템 설계 및 구현)

  • Jeong, Hoe-Gyeong;Kim, Yong-Ho;Kim, Jin-Su
    • The Journal of Engineering Research
    • /
    • v.1 no.1
    • /
    • pp.71-80
    • /
    • 1997
  • This paper describes the conversion system which translates wordprocessor file into ODIF(Open Document Interchange Format) data stream for document interchange between heterogeneous systems and applications. We design the translator according to ODA of international base standard, FOD26 of international funtional standard, and a BosukGul V file for source document file. This system is implemented using the Microsoft Windows and the ODAC(ODA Consortium) toolkit in the PC environment.

  • PDF

A Study on Extracting the Document Text for Unallocated Areas of Data Fragments (비할당 영역 데이터 파편의 문서 텍스트 추출 방안에 관한 연구)

  • Yoo, Byeong-Yeong;Park, Jung-Heum;Bang, Je-Wan;Lee, Sang-Jin
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.20 no.6
    • /
    • pp.43-51
    • /
    • 2010
  • It is meaningful to investigate data in unallocated space because we can investigate the deleted data. Consecutively complete file recovery using the File Carving is possible in unallocated area, but noncontiguous or incomplete data recovery is impossible. Typically, the analysis of the data fragments are needed because they should contain large amounts of information. Microsoft Word, Excel, PowerPoint and PDF document file's text are stored using compression or specific document format. If the part of aforementioned document file was stored in unallocated data fragment, text extraction is possible using specific document format. In this paper, we suggest the method of extracting a particular document file text in unallocated data fragment.

A methodology for XML documentation of the structural calculation document to build database supporting safety management of infrastructures (사회기반시설물 안전관리 지원 데이터베이스 구축을 위한 구조계산서의 XML 문서화 방법론)

  • Kim, Bong-Geun;Park, Sang-Il;Lee, Jin-Hoon;Lee, Sang-Ho
    • 한국방재학회:학술대회논문집
    • /
    • 2007.02a
    • /
    • pp.414-417
    • /
    • 2007
  • A methodology for XML documentation of the structural calculation document is presented to support manipulation of the design information on the internet. The text file format is chosen as a neutral format because it can be easily translated from office documents generated from engineering practice. The first word of each line is compared with the reserved numbering groups, and relative levels among the lines are defined to generate the hierarchically structured XML document of the text file. The demonstration subjected to sample general documents and structural calculation documents shows that the prototype application module based on the developed methodology can be adopted to build the database of design information which supports the safety management of infrastructures.

  • PDF

A Study on the Selection of Preservation Format for Long-Term Preservation of Electronic Records (전자기록물의 장기보존을 위한 보존포맷 선정 방안에 관한 연구)

  • Han, Hui-Jeong;Oh, Hyo-Jung;Yang, Dongmin
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.20 no.1
    • /
    • pp.69-87
    • /
    • 2020
  • For the long-term preservation of document-type electronic records, the National Archives of Korea has chosen PDF/A-1 as the preservation format named as the document file format, and established it as a public standard. The only option of selecting PDF/A-1 restricts the use of various electronic file formats that can or must be applied to actual works as IT advances and tasks change. Moreover, it is difficult to apply PDF/A-1 to other types of electronic records (administrative information datasets, audiovisual records, web records, etc.). Therefore, it is necessary to diversify the preservation formats of electronic records. We suggest a framework for selecting various preservation formats. Furthermore, we propose common criteria and evaluation methods frequently applied to all electronic records when selecting a preservation format, and introduce a methodology for deriving intrinsic criteria applied to each type of electronic records.

Methods for Investigating of Edit History about MS PowerPoint Files That Using the OOXML Formats (OOXML형식을 사용하는 MS 파워포인트 파일에 대한 편집 이력 조사 방법)

  • Youn, Ji-Hye;Park, Jung-Heum;Lee, Sang-Jin
    • The KIPS Transactions:PartC
    • /
    • v.19C no.4
    • /
    • pp.215-224
    • /
    • 2012
  • Today, individuals and businesses are a lot of paperwork through a computer. So many documents files are creating to digital type. And the digital type files are copied, moved by various media such as USB, E-mail and so on. A careful analysis of these digital materials can be tracked that occurred during the document editing work history. About these research are on the compound document file format, but has not been studied about the new OOXML format that how to analyze linkages between different document files, tracking an internal order, finding unsaved file for identify the process of creating the file. Future, the use of OOXML format digital documents will further increase, these document work history traceability in digital forensic investigation would be a big help. Therefore, this paper on the new OOXML format(has a forensic viewpoint) will show you how to track the internal order and analyze linkages between the files.

Effects of Adopting the Open Document Format in Public Records Management (공문서 컴포넌트 오픈포맷 채택이 기록관리에 미치는 영향 분석)

  • Jung, Mi Ri;Oh, Seh-La;Yim, Jin Hee
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.16 no.2
    • /
    • pp.29-55
    • /
    • 2016
  • Korean public organizations create electronic documents through electronic document management systems under the e-Government scheme. A majority of these public documents are saved in vendor-dependent file formats, mainly HWP. Vendor-dependent formats can be opened only with specific software, which requires purchase. As the license does not guarantee compatibility between past and future versions, interoperability problems occur in long-term preservation, which need to be solved. Any error from the elimination of styles or no matching elements in document definition during conversion from vendor-dependent formats to the XML-based standard exchange format leads to file open failure or the modification of original documents. This study introduces the Open Document Format (ODF) and investigates the effects of adopting ODF in the creation, exchange, management, and preservation of public records.