• Title/Summary/Keyword: document storing

Search Result 72, Processing Time 0.025 seconds

XML Documents Clustering Technique Based on Bit Vector (비트벡터에 기반한 XML 문서 군집화 기법)

  • Kim, Woo-Saeng
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.47 no.5
    • /
    • pp.10-16
    • /
    • 2010
  • XML is increasingly important in data exchange and information management. A large amount of efforts have been spent in developing efficient techniques for accessing, querying, and storing XML documents. In this paper, we propose a new method to cluster XML documents efficiently. A bit vector which represents a XML document is proposed to cluster the XML documents. The similarity between two XML documents is measured by a bit-wise AND operation between two corresponding bit vectors. The experiment shows that the clusters are formed well and efficiently when a bit vector is used for the feature of a XML document.

Text Document Categorization using FP-Tree (FP-Tree를 이용한 문서 분류 방법)

  • Park, Yong-Ki;Kim, Hwang-Soo
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.11
    • /
    • pp.984-990
    • /
    • 2007
  • As the amount of electronic documents increases explosively, automatic text categorization methods are needed to identify those of interest. Most methods use machine learning techniques based on a word set. This paper introduces a new method, called FPTC (FP-Tree based Text Classifier). FP-Tree is a data structure used in data-mining. In this paper, a method of storing text sentence patterns in the FP-Tree structure and classifying text using the patterns is presented. In the experiments conducted, we use our algorithm with a #Mutual Information and Entropy# approach to improve performance. We also present an analysis of the algorithm via an ordinary differential categorization method.

Management of the Structure Information of HyTime Documents using Object-Oriented Database (객체 지향 데이타베이스를 이용한 HyTime 문서의 구조 정보 관리)

  • 박인호;강현석
    • Journal of Korea Multimedia Society
    • /
    • v.5 no.4
    • /
    • pp.351-360
    • /
    • 2002
  • HyTime(Hypermedia/Time-based Structuring Language), an international standard language to describe hypermedia electronic documents, is used to support the synchronization between various multimedia data for hypermedia applications. To manage the HyTime documents efficiently for shared environment, the logical structure information of them should be managed by database in a systematic way. In this Paper, we design a meta-database schema of HyTime DTDs(Document Type Definition) which define the logical structure of hypermedia documents and show how to manage the meta-database schema for storing the HyTime DTDs in the object-oriented database.

  • PDF

The Algorithm For Spatial XQuery2SQL Converter (Spatial XQuery2SQL Converter를 위한 알고리즘)

  • Choi, Young Nn;Seo, Hyun-Ho
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2004.11a
    • /
    • pp.442-447
    • /
    • 2004
  • XML is normalized text form that is designed to transmit structured document in web as that propose in W3C (World Wide Web Consortium) in 1996. Function that this can overcome HTML's limit that use in existing in Internet and user define new tag to HTML by way to solve SGML's complexity added. There is many efforts to use storing this XML document in RDBMS but to relation style DB because XML document is tree structure structurally data SQL and perfect disaster caused by things that is language to ask a question accomplish XQuery that so it is W3C's XML standard query appear. After store XML informations including space information to RDBMS in this paper, Spatial XQuery through converter that is Sqatial XQuery2SQL through Spatial operator, Spatial function SQL of by Sqatial XQuery2SQL conversion algorithm that draw information in RDBMS after change embody wish to.

  • PDF

DEDMS : Distributed Environment Document Management System Model based on the XML-RPC (XML-RPC 기반의 분산환경 문서관리 시스템 모델)

  • 고혁준;김정희;곽호영
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.8 no.2
    • /
    • pp.394-406
    • /
    • 2004
  • Even the document resources offered from web server can be represented in the form of URL/URI, it can not necessarily be guaranteed that corresponding resources exist due to a dynamic change of sewer environment In this paper, integrated document administration system is therefore proposed and modeled using the XML-RPC technology which guarantees the reliance of resources, and handles a dynamic server resource management and request of clients. The proposed system is composed of middleware and server systems. The former system manages dynamic server resources, and the latter reports the updated information of documentations stored in server by client from the server to middleware system. As a result, effective storing management of dynamic resource in distributed server could be archived and building cost of a new web server could be reduced due to an applicability to current web sewer. In addition platform independent and efficient data management was obtained by using the XML-RPC protocol.

A study on the development of Korean Indigenous beverages in a research on GAL-Soo (한국 고유음료류 개발에 관한 연구중 갈수에 관한 연구)

  • 오승희
    • Journal of the Korean Professional Engineers Association
    • /
    • v.15 no.1
    • /
    • pp.14-23
    • /
    • 1982
  • 1) The characteristic of the recipe for GAL-SOO is to make odorous, pharmaceutical cereal into plaster by honey-mixed boiling so as to be drunken whenever it needed. But the cereal which included sugar was boiled without adding sugar. 2) The part of cereal which was used mostly in making GAL-SOO was seeds of cereals. 3) The history of GAL-SOO was derived from GU GA PIL YONG, a document of WON dynasty of GHINA, but it was developped according to our taste in this county. 4) The pharmaceutical action of GAL-SOO was mainly to strengthen stomach, digestion and appeasing thirst. 5) GAL-SOO tastes so sour and sweet that it was splendid to drink especially in summer as a beverage. 6) The Value of developping GAL-SOO as a beverage is highly approved because of its expediency of recipe and tough endurance storing under ordinary temperature.

  • PDF

A Study on Document Storage and Information Retrieval for Educational Informatization (교육정보화를 위한 문서저장 및 정보검색에 관한 연구)

  • Kang, Mu-Yeong;Lee, Sang-Gu
    • Journal of The Korean Association of Information Education
    • /
    • v.3 no.1
    • /
    • pp.1-12
    • /
    • 1999
  • By recent advances in the information management tools, Education Softwares and Study Wares.are rapidly developed and distributed over the Internet very fast. As Education Administrative Information systems are developed, information generated from the education field is growing large and fast. In this paper, we suggest a method of efficiently storing and retrieving such information in the large data basis, in real time. We also demonstrate an information system by the proposed method.

  • PDF

A study on storing a XML/EDI document with XLink (XLink를 이용한 XML/EDI 문서의 저장에 관한 연구)

  • 김수영;윤용익
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.10c
    • /
    • pp.703-705
    • /
    • 2001
  • 전통 EDI(Electronic Data Interchange) 문서는 VAN(Value Added Network) 전용망을 통하여 EDI 서비스를 하였다. 하지만, 이것은 실시간(real-time)으로 문서를 처리하는 방식보다 주로 배치(batch)방식으로 한꺼번에 문서를 처리하였으며 전용 소프트웨 어를 사용함으로써 새로운 문서를 처리할때마다 새 문서에 대한 정보를 등록하고 소프트웨어를 다시 설치 해야하는 불편함도 있었다. 기존의 전통 EDI 문서는 VAN을 통하여 처리하는 방식이었으나 현재는 인터넷에서 EDI 문서를 볼 수 있도록 하기 위해 XML(extensible Markup Language)을 이용하고 있다. 인터넷기반의 웹 브라우저 상에서 볼 수 있는 XML/EDI 구현에 힘입어 여러 문서로 분리되어진 EDI 문서를 XLink의 개념을 이용하여 문서의 삽입, 삭제 기능과 이러한 문서를 통합하여 하나의 문서로 데이터베이스에 저장할 수 있는 방법에 관하여 연구하고자 한다.

  • PDF

A Transformation of XML DTD to Relational Database Schema Using Functional Dependency (함수적 종속관계를 이용한 XML DTD의 관계형 스키마 변환)

  • Lee Jung-hwa;Lee Man-sik;Yun Hong-won
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.8 no.7
    • /
    • pp.1604-1609
    • /
    • 2004
  • We have to convert XML DTD into relational database schema for storing XML Document at relational database. Hybrid inlining algorithm are used for converting XML DTD to relational database schema. But this method have some problem. That is the relational database schema have N:N relationship are created according this method are not satisfied with third normal from. Therefore, We proposed Extended Hybrid inlining algorithm for solving this problem in this paper.

XML Document Clustering Technique by K-means algorithm through PCA (주성분 분석의 K 평균 알고리즘을 통한 XML 문서 군집화 기법)

  • Kim, Woo-Saeng
    • The KIPS Transactions:PartD
    • /
    • v.18D no.5
    • /
    • pp.339-342
    • /
    • 2011
  • Recently, researches are studied in developing efficient techniques for accessing, querying, and storing XML documents which are frequently used in the Internet. In this paper, we propose a new method to cluster XML documents efficiently. We use a K-means algorithm with a Principal Component Analysis(PCA) to cluster XML documents after they are represented by vectors in the feature vector space by transferring them as names and levels of the elements of the corresponding trees. The experiment shows that our proposed method has a good result.