• Title/Summary/Keyword: Electronic document

Search Result 485, Processing Time 0.03 seconds

Digital Forensics of Microsoft Office 2007-2013 Documents to Prevent Covert Communication

  • Fu, Zhangjie;Sun, Xingming;Xi, Jie
    • Journal of Communications and Networks
    • /
    • v.17 no.5
    • /
    • pp.525-533
    • /
    • 2015
  • MS Office suit software is the most widely used electronic documents by a large number of users in the world, which has absolute predominance in office software market. MS Office 2007-2013 documents, which use new office open extensible markup language (OOXML) format, could be illegally used as cover mediums to transmit secret information by offenders, because they do not easily arouse others suspicion. This paper proposes nine forensic methods and an integrated forensic tool for OOXML format documents on the basis of researching the potential information hiding methods. The proposed forensic methods and tool cover three categories; document structure, document content, and document format. The aim is to prevent covert communication and provide security detection technology for electronic documents downloaded by users. The proposed methods can prevent the damage of secret information embedded by offenders. Extensive experiments based on real data set demonstrate the effectiveness of the proposed methods.

XML Repository System Using DBMS and IRS

  • Kang, Hyung-Il;Yoo, Jae-Soo;Lee, Byoung-Yup
    • International Journal of Contents
    • /
    • v.3 no.3
    • /
    • pp.6-14
    • /
    • 2007
  • In this paper, we design and implement a XML Repository System(XRS) that exploits the advantages of DBMSs and IRSs. Our scheme uses BRS to support full text indexing and content-based queries efficiently, and ORACLE to store XML documents, multimedia data, DTD and structure information. We design databases to manage XML documents including audio, video, images as well as text. We employ the non-composition model when storing XML documents into ORACLE. We represent structured information as ETID(Element Type Id), SORD(Sibling ORDer) and SSORD(Same Sibling ORDer). ETID is a unique value assigned to each element of DTD. SORD and SSORD represent an order information between sibling nodes and an order information among the sibling nodes with the same element respectively. In order to show superiority of our XRS, we perform various experiments in terms of the document loading time, document extracting time and contents retrieval time. It is shown through experiments that our XRS outperforms the existing XML document management systems. We also show that it supports various types of queries through performance experiments.

Design and Implementation of BADA-IV/XML Query Processor Supporting Efficient Structure Querying (효율적 구조 질의를 지원하는 바다-IV/XML 질의처리기의 설계 및 구현)

  • 이명철;김상균;손덕주;김명준;이규철
    • The Journal of Information Technology and Database
    • /
    • v.7 no.2
    • /
    • pp.17-32
    • /
    • 2000
  • As XML emerging as the Internet electronic document language standard of the next generation, the number of XML documents which contain vast amount of Information is increasing substantially through the transformation of existing documents to XML documents or the appearance of new XML documents. Consequently, XML document retrieval system becomes extremely essential for searching through a large quantity of XML documents that are storied in and managed by DBMS. In this paper we describe the design and implementation of BADA-IV/XML query processor that supports content-based, structure-based and attribute-based retrieval. We design XML query language based upon XQL (XML Query Language) of W3C and tightly-coupled with OQL (a query language for object-oriented database). XML document is stored and maintained in BADA-IV, which is an object-oriented database management system developed by ETRI (Electronics and Telecommunications Research Institute) The storage data model is based on DOM (Document Object Model), therefore the retrieval of XML documents is executed basically using DOM tree traversal. We improve the search performance using Node ID which represents node's hierarchy information in an XML document. Assuming that DOW tree is a complete k-ary tree, we show that Node ID technique is superior to DOM tree traversal from the viewpoint of node fetch counts.

  • PDF

XML Schema Document Editing System (XML 스키마 문서편집 시스템)

  • 차원준;최일선;김창수;정회경
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2003.05a
    • /
    • pp.285-288
    • /
    • 2003
  • Electronic commerce that is constructed in existing and XML document that is used in e-Business field made out because based to DTD. However, XML applications that XML Schema is much after become Recommendation in W3C May, 2001 XML Schema real condition that is developed to base be. Selected XML Schema in ebXML Registry 2.0 by representative thing connected with this. If develop XML Schema, compare when develop using existent DTD and complexity by namespace or object-oriented concept etc. was increasing, and could programming by various method. XML Schema document that is used by ebXML Framework in treatise that see hereupon study about XML Schema document editing system that offer mastication and user interface that can edit efficiently do.

  • PDF

Application Plan of Document Databases in the Big Data Environment (빅데이터환경에서의 문서데이터베이스 활용방안)

  • Park, Sungbum;Lee, Sangwon;Ahn, Hyunsup;Jung, In-Hwan
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2013.10a
    • /
    • pp.230-232
    • /
    • 2013
  • For Many enterprises are creating and handling huge amount of data in their business administration. However, it would be impossible for general databases such as Relational Databases, hierarchical databases, and network databases to manage and analyze this large amount of document data efficiently and effectively. So, in this paper, we define document databases and check out their characteristics such as consistency and transaction. And we propose appropriate or inappropriate subjects for application of document databases.

  • PDF

A Study on Service of Certified e-Document Authority System (공인전자문서보관소 서비스에 관한 연구)

  • Nam, Tae-Woo;Kim, Eun-Jeong
    • Journal of Information Management
    • /
    • v.40 no.2
    • /
    • pp.25-45
    • /
    • 2009
  • This study deals with a certified e-document authority system to securely archive e-documents by a government authorized third party. For this study, I have looked at three cases of certified e-document authority systems and compared then with the similar cases in the USA and Japan. In my conclusion, I suggest four ideas to improve the service of certified e-document authority system. First, the company providing the certified e-document authority system needs to expand to special services. Second, they need to concentrate on subject area of their business. Third, they should provide a consulting service for business archives with business customers. Finally, they can support additional services like document risk management and other digital contents archiving.

Overview and Future Plan on Electronic Document Handling(EDH) of ITU-T (ITU-T 전자정보유통시스템의 현황과 과제)

  • Gu, Gyeong-Cheol;Park, Gi-Sik
    • Electronics and Telecommunications Trends
    • /
    • v.12 no.2 s.44
    • /
    • pp.103-118
    • /
    • 1997
  • 최근 국제표준화기구인 ITU를 비롯해 ETSI, T1 Committee, TTA, TTC, ATSC, TSACC 등 각 지역 표준화 기구 (Participating Standardization Organization: PSO)들은 빠른 기술개발에 따른 적기의 표준공급 및 전자적인 표준화문서유통을 통한 신속한 표준제정을 위해 EDH(Electronic Document Handling)라는 전자정보유통시스템을 구축하고 기고서 및 표준문서 등 표준화 진행에 관련된 각종 정보를 전자적으로 검색하고 처리할 수 있는 환경을 구축하는데 많은 노력을 기울이고 있다. 이와 관련하여, 본 고에서는 제2차 세계전기통신표준총회(WTSC-96)에서 가장 활발하게 논의된 사항 중의 하나인 ITU-T/TSAG/EDH 관련 표준화 활동 현황을 고찰하고, 향후 EDH의 방향을 소개하고자 한다.

A Comparison of Electronic book metadata formats and Development of Electronic Book of Korea Standard metadata (eBook 메타데이터 비교 및 한국전자책표준의 메타데이터 개발)

  • 김경옥;김성혁;임순범;최윤철
    • Proceedings of the CALSEC Conference
    • /
    • 2001.08a
    • /
    • pp.511-521
    • /
    • 2001
  • This paper is to develop metadata format for eBook document standard at Korea. Metadata formats of OEBF, JepaX and AAP were compared and analyzed on the criteria such as purpose, basic elements, characteristics, compatibility, extensibility and convertibility. EBKS metadata format based on Dublin Core was developed in terms of easy to use, resources descriptions and discovery, extensibility and compatibility between other metadata formats such as MARC and Dublin Core. Finally, research and revision direction of the eBook document standard were proposed for the future study.

  • PDF

A Preliminary Study on Clinical Decision Support System based on Classification Learning of Electronic Medical Records

  • Shin, Yang-Kyu
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.4
    • /
    • pp.817-824
    • /
    • 2003
  • We employed a hierarchical document classification method to classify a massive collection of electronic medical records(EMR) written in both Korean and English. Our experimental system has been learned from 5,000 records of EMR text data and predicted a newly given set of EMR text data over 68% correctly. We expect the accuracy rate can be improved greatly provided a dictionary of medical terms or a suitable medical thesaurus. The classification system might play a key role in some clinical decision support systems and various interpretation systems for clinical data.

  • PDF

Study on History Tracking Technique of the Document File through RSID Analysis in MS Word (MS 워드의 RSID 분석을 통한 문서파일 이력 추적 기법 연구)

  • Joun, Jihun;Han, Jaehyeok;Jung, Doowon;Lee, Sangjin
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.28 no.6
    • /
    • pp.1439-1448
    • /
    • 2018
  • Many electronic document files, including Microsoft Office Word (MS Word), have become a major issue in various legal disputes such as privacy, contract forgery, and trade secret leakage. The internal metadata of OOXML (Office Open XML) format, which is used since MS Word 2007, stores the unique Revision Identifier (RSID). The RSID is a distinct value assigned to a corresponding word, sentence, or paragraph that has been created/modified/deleted after a document is saved. Also, document history, such as addition/correction/deletion of contents or the order of creation, can be tracked using the RSID. In this paper, we propose a methodology to investigate discrimination between the original document and copy as well as possible document file leakage by utilizing the changes of the RSID according to the user's behavior.