• Title/Summary/Keyword: XML retrieval

Search Result 277, Processing Time 0.027 seconds

Design of an Information Retrieval Indexing Method using XML Links (XML 링크정보를 이용한 정보 검색 색인 기법의 설계)

  • Kim, Eun-Jeong;Bae, Jong-Min
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.7
    • /
    • pp.2020-2027
    • /
    • 2000
  • The hypertext document is used for information exchange in the Web environments. Its structure is considered as having graph structures with links, which makes nonlinear processing of documents possible. This paper proposes an indexing method for information retrieval system using XML links. We define new attributes that control links of a remote document and assign an unique identifier for the attribute of each link. Each identifier has a different weight according to its occurrence position that is local or remote documents. We index a word not only from a local document but a remote document based on the given weight. Experimental results show that the proposed method outperforms conventional retrieval systems that ignore links.

  • PDF

Path Signatures : Path-oriented Query Processing System for XML document Retrieval (경로 서명 : XML문서 검색을 위한 경로-지향 질의처리 시스템)

  • Park, Hee-Sook;Park, Ju-Hyun;Cho, Woo-Hyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.11 no.7
    • /
    • pp.1311-1317
    • /
    • 2007
  • Recently, due to the popularity and explosive growth of the Internet, the information exchange is increasing so rapidly over the Internet. Also the XML is becoming a standard as well as a major tool of data exchange on the Internet and thus we propose the new indexing technique for evaluating a path-oriented query and design and implementation of Path-oriented Query Processing System to give useful for users. In proposed indexing technique, which combined a binary trio structure with a path signature file to improve performance of XML document retrieval.

Storage and Retrieval of XML Documents Without Redundant Path Information (경로정보의 중복을 제거한 XML 문서의 저장 및 질의처리 기법)

  • Lee Hiye-Ja;Jeong Byeong-Soo;Kim Dae-Ho;Lee Young-Koo
    • The KIPS Transactions:PartD
    • /
    • v.12D no.5 s.101
    • /
    • pp.663-672
    • /
    • 2005
  • This Paper Proposes an approach that removes the redundancy of Path information and uses an inverted index, as an efficient way to store a large volume of XML documents and to retrieve wanted information from there. An XML document is decomposed into nodes based on its tree structure, and stored in relational tables according to the node type, with path information from the root to each node. The existing methods using path information store data for all element paths, which cause retrieval performance to be decreased with increased data volume. Our approach stores only data for leaf element path excluding internal element paths. As the inverted index is made by the leaf element path only, the number of posting lists by key words become smaller than those of the existing methods. For the storage and retrieval of U data, our approach doesn't require the XML schema information of XML documents and any extension of relational database. We demonstrate the better performance of on approach than the existing approaches within the scope of our experiment.

Implementation of an XML-Based Editor/Transformer for Large Volume of Similar Documents (XML 기반의 대용량 유사 문서 편집기/변환기 구현)

  • 황인준
    • The Journal of Society for e-Business Studies
    • /
    • v.9 no.1
    • /
    • pp.21-38
    • /
    • 2004
  • With its recent popularity, Web is now considered as a huge repository of information. Most documents on the web have been created using HTML(Hyper Text Markup Language). Even though HTML is simple and easy to learn, it has several features that are obstacles to the efficient information retrieval. XML(eXtensible Markup Language) can provide a solution to such problems and in fact, has already been used in many applications, XML is a standard markup language for exchanging data on the web. It can describe a document structure freely by defining its DTD, which enables efficient integration and retrieval of data on the web. In this paper, we propose a versatile and efficient XML document manager. Its features include (i) form-based XML editor that enables easy creation of new XML documents, (ii) automatic document converter that can transform HTML documents with similar structure into XML documents automatically, and (iii) GUI-based DTD editor.

  • PDF

A Path Combining Strategy for Efficient Storing of XML Documents (XML 문서의 효율적인 저장을 위한 경로 통합 기법)

  • Lee, Bum-Suk;Hwang, Byung-Yeon
    • Journal of Korea Multimedia Society
    • /
    • v.9 no.10
    • /
    • pp.1257-1265
    • /
    • 2006
  • As XML is increasingly used, the need of researches which are related with XML in various fields is also augmented. Many XML document management systems have been actively developed especially for the storage, processing and retrieval of XML documents. The BitCube is a three dimensional bitmap index system that could be manipulated efficiently and improves the performance of document retrieval. However, the site of index is increase rapidly, when a new bit is added to the axis. This problem is caused by its three dimensional memory structure with document, path and word. We suggest a path combining strategy of XML documents in this paper to solve the problem of BitCube that mentioned above. To reduce the size of index, our approach combines sibling nodes that has same ancestor paths, and transforms word axis into value axis. The method reduces the size of index, when the system com poses the three dimensional bitmap index. It also improves the speed of retrieving, and takes efficiency in storage space.

  • PDF

Multimedia Learning Contents Retrieval Based on XML/RDF and SMIL (XML/RDF와 SMIL에 기반한 멀티미디어 교육 컨텐츠 검색)

  • Choi, Byung-Uk;Ryu, Jung-Woo;Cho, Jung-Won
    • The Journal of Korean Association of Computer Education
    • /
    • v.5 no.3
    • /
    • pp.45-58
    • /
    • 2002
  • In this paper, we propose the new approach with which user is able to retrieve the massive volume of learning contents in the multimedia learning system. In order to secure the compatibility of learning contents. we apply the SMIL on the basis of XML, so that the integration and the synchronization of multimedia components can be available to realize in the mode of standardization. We also implement the multimedia learning contents represented by the RDF on the IEEE LOM. We present the two step-retrieval method to get precise results. In the first step, user can find with high speed and ease whatever contents user wants to take a look through metadata in the system. The second step is followed that by using the time information of SMIL, user can retrieve the interest synchronous parts in the result of the first step. This innovative retrieval approach applied in the multimedia learning system is highly expected to make a meaningful contribution to implement the principles of self-directed learning in the learning environments, where user can use and revise the retrieval results for their own learning purpose and make further the active knowledge-reconstruction.

  • PDF

The Design and Implementation of Item pool System using XML (XML을 이용한 문제은행 시스템 설계 및 구현)

  • 하명희;박남숙
    • KSCI Review
    • /
    • v.8 no.2
    • /
    • pp.33-42
    • /
    • 2001
  • The purpose of this study was to help retrieve and assess only what learner wants. The multiple-choice and short-answer types were selected. and a sort of a question bank was organized in consideration of the degree of difficulty and frequency of being questioned in such a way to have a discriminating power. For item retrieval the stored information was converted into XML data, instead of simply searching information from database. and that data were retrieved through Xpath. And it's designed to show the retrieval output by using XML on browser. Concerning item evaluation. evaluation items were produced by inputting the degree of difficulty and frequency of being questioned of the subject and unit learner wants. and then by inputting the number of individual item type. The learning outcome was offered in real time to learner. and learner could repeatedly drill what they gave a wrong answer.

  • PDF

XML-Based Tourism Information System Using Mobile Agent under Distributed Environment (이동 에이전트를 이용한 분산환경 하에서의 XML-기반 관광정보시스템)

  • Lee Dong-Cheol;Choi Doug W.
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.9 no.3
    • /
    • pp.654-660
    • /
    • 2005
  • The internet is comprised of various users with diverse hardware and software platforms. This paper presents a tourism information system which enables the stable and reliable transmission of information over the dispersed, heterogeneous, and/or mobile platforms. The proposed system assumes XML as the basic document format since it has been accepted by W3C as the standard for information exchange on the internet This paper exploits the characteristics of JAVA and XML as they provide software applications independent of the platforms. The proposed system also deploys Aglet, a mobile agent developed by IBM, to ensure a dynamic and flexible performance of the system over the internet. The system provides the user oriented search and retrieval of tourism information, and also enables the reservation of various services and facilities with mobile devices.

Implementation of on Automatic Tool Generating a XML Document from Database Retrieval (데이터베이스 질의 결과로부터 XML 문서 자동 생성 도구 구현)

  • 조승호;이원진
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2003.11a
    • /
    • pp.396-399
    • /
    • 2003
  • 본 연구에서는 객체를 중간 매개체로 활용하여 관계형 데이터베이스로부터 XML 문서를 자동적으로 추출하는 도구에 대하여 구현하였다 본 시스템은 XML 문서와 데이터베이스간 맵핑을 위하여 객체-관계 맵핑을 적용하였으며. 데이터베이스 설정. 관계-객체 스키마 맵핑 XML 생성 등의 기능을 제공한다 본 연구 결과는 데이터베이스 내용을 XML 문서로 생성하여 유무선 컨텐츠를 사용하는 사용자에게 일관된 정보를 제공하거나 기업간 정보 교환시 유용하게 활용될 수 있다.

  • PDF

Information Retrieval from XML Documents based on Contents (내용기반 XML 문서의 검색)

  • 김수희;조명찬;한예지
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2003.10b
    • /
    • pp.73-75
    • /
    • 2003
  • 이 연구에서는 XML 문서의 효율적인 검색을 위해 XML 데이터에서 색인어를 추출하고 가중치를 부여하여 내용기반 인덱스를 구축하고, 질의와 문서간의 유사도가 높은 문서들을 사용자에게 제공함으로써 기존의 경로 중심 혹은 패턴매칭 형태의 XML 문서 검색 기능을 확장하고자 한다. 내용기반 검색을 지원하는 XML 문서 검색시스템을 설계하고, 내용기반 검색과 관련한 이슈들을 논의한다. 개발 중에 있는 연구용 프로토타입 시스템을 이용하여 질의에 대한 내용기반 검색 결과를 간단히 소개한다.

  • PDF