Search | Korea Science

Document Filtering Algorithm for Efficient Preprocessing of XML Information Retrieval (XML 정보검색의 효율적 전처리를 위한 문서여과 알고리즘)

Kong Yong-Hae;Kim Myung-Sook
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.6 no.1
- /
- pp.1-11
- /
- 2005
The paper proposes a preprocessing method for efficient processing of XML queries in information retrieval with a large amount of XML documents. The conventional preprocessing methods filter out XML documents by parsing XML document for keyword of query or by comparing query signatures with signatures of XML document to be generated. But these methods are dependent on a query and are very in efficient for a large amount of XML documents. For this, we generate a universal DTD based on ontology of a domain. The universal DTD is applicable to the XML documents when they contain information of a same domain even when they have different structures and attributes. Then, using the universal DTD, we filter out the XML documents that are not bounded in the domain. We evaluate the performance of this method through experiments.
PDF

Design of XML DTDs for Content-based Retrieval of Web Image (웹 이미지 내용 기반 검색을 위한 XML DTD 설계)

김형근;홍성용;나연묵
- Proceedings of the Korean Information Science Society Conference
- /
- 2001.10a
- /
- pp.232-234
- /
- 2001
인터넷의 발달과 사용의 확산에 따라 멀티미디어 데이터의 양이 급격히 증가하고 있다. 특히 멀티미디어 정보 가운데에서도 이미지 양은 대규모이므로 사용자가 원하는 이미지를 찾기가 쉽지 않았으며, 이에 따라 이미지 데이타를 검색하기 위한 여러 가지 방법들이 계속해서 제안되고 있다. 본 논문에서는 XML을 활용하여 웹상의 이미지 데이터에 대한 특징 정보를 구조적으로 표현해 웹 이미지에 대한 내용 기반 검색 능력을 개선한다. 관계 테이터베이스에 저장된 색상, 질감, 키워드 등 이미지 데이터에 대한 특징 정보들을 XML 문서로 자동 변환하기 위하여 이들 각각의 대한 DTD를 설계하고, 이들을 통합하여 검색할 수 있도록 통합 DTD를 설계한다. 통합 DTD를 XML 데이터 서버를 이용하여 구현에 실제 웹 상의 상품이미지를 검색하는데 적용함으로써 제안한 결과의 유용성을 보인다.
PDF

XML-based Modeling for Semantic Retrieval of Syslog Data (Syslog 데이터의 의미론적 검색을 위한 XML 기반의 모델링)

Lee Seok-Joon;Shin Dong-Cheon;Park Sei-Kwon
- The KIPS Transactions:PartD
- /
- v.13D no.2 s.105
- /
- pp.147-156
- /
- 2006
Event logging plays increasingly an important role in system and network management, and syslog is a de-facto standard for logging system events. However, due to the semi-structured features of Common Log Format data most studies on log analysis focus on the frequent patterns. The extensible Markup Language can provide a nice representation scheme for structure and search of formatted data found in syslog messages. However, previous XML-formatted schemes and applications for system logging are not suitable for semantic approach such as ranking based search or similarity measurement for log data. In this paper, based on ranked keyword search techniques over XML document, we propose an XML tree structure through a new data modeling approach for syslog data. Finally, we show suitability of proposed structure for semantic retrieval.
https://doi.org/10.3745/KIPSTD.2006.13D.2.147 인용 PDF KSCI

A Signature Method for Efficient Preprocessing of XML Queries (XML 질의의 효율적인 전처리를 위한 시그너처 방법)

정연돈;김종욱;김명호
- Journal of KIISE:Databases
- /
- v.30 no.5
- /
- pp.532-539
- /
- 2003
The paper proposes a pre-processing method for efficient processing of XML queries in information retrieval systems with a large amount of XML documents. For the pre-processing, we use a signature-based approach. In the conventional (flat document-based) information retrieval systems, user queries consist of keywords and boolean operators, and thus signatures are structured in a flat manner. However, in XML-based information retrieval systems, the user queries have the form of path query. Therefore, the flat signature cannot be effective for XML documents. In the paper, we propose a structured signature for XML documents. Through experiments, we evaluate the performance of the proposed method.
PDF KSCI

Lesson Plan System for Teacher-Student Based on XML (XML 기반 교수-학생 학습지도 시스템)

최문경;김지영;김행곤
- Proceedings of the Korean Information Science Society Conference
- /
- 2002.10d
- /
- pp.406-408
- /
- 2002
컴퓨터 기술의 발전과 네트워크의 급속한 확산으로 사회전반에 걸쳐 특허, 기업뿐 아니라 교육 현장의 효율화를 지원하기 위한 분야에서도 웹이 응용되고 있다. 교육 현장에서 작성되어지고 있는 문서 중 학습 지도안 작성은 교육 정보의 체계적인 제공이 미흡하고, 많은 시간과 노력이 요구되는 활동이므로 교수 개인이 모든 교수 활동에 필요한 지도안을 작성하는데는 어려움이 있다. 이를 위해, 웹에서 정보를 공유하여 문서의 재사용성을 높일 수 있는 시스템이 필요하게 되었다. 웹에서 표준화된 XML을 이용하여 문서의 생성과 검색, 그리고 재사용이 가능하도록 제공함으로써 교수자의 다양한 요구사항을 융통성 있게 수용할 수 있다. 본 논문에서는 학습지도안 시스템을 분석하여 공통DTD(Document Type Definition)를 생성하고 공통 DTD를 통해 표준화된 XML 문서를 제공한다. 좀더 효율적인 수업을 위해 학습지도안 작성이 용이하도록 학습지도안 작성용 에디터를 제공하며, 또한 XML DOM(Document Object Model)을 이용하여 검색기에서는 구조기반, 패싯, 키워드 검색 방법을 제시하고, 등록기에서는 DOM을 이용하여 해당 데이터를 추출하고 DB에 등록한다. 이는 문서의 재사용성을 높일 수 있다. 따라서, XML을 학교 현장에서 이용함으로써 웹에서 정보의 공유를 원활히 하고, 문서 작성의 효율성을 높이고자 한다.
PDF

An Efficient Keyword Search Method on RDF Data (RDF 데이타에 대한 효율적인 검색 기법)

Kim, Jin-Ha;Song, In-Chul;Kim, Myoung-Ho
- Journal of KIISE:Databases
- /
- v.35 no.6
- /
- pp.495-504
- /
- 2008
Recently, there has been much work on supporting keyword search not only for text documents, but a]so for structured data such as relational data, XML data, and RDF data. In this paper, we propose an efficient keyword search method for RDF data. The proposed method first groups related nodes and edges in RDF data graphs to reduce data sizes for efficient keyword search and to allow relevant information to be returned together in the query answers. The proposed method also utilizes the semantics in RDF data to measure the relevancy of nodes and edges with respect to keywords for search result ranking. The experimental results based on real RDF data show that the proposed method reduces RDF data about in half and is at most 5 times faster than the previous methods.
PDF KSCI

Service-centric Object Fragmentation Model for Efficient Retrieval and Management of XML Documents (XML 문서의 효율적인 검색과 관리를 위한 SCOF 모델)

Jeong, Chang-Hoo
- Proceedings of the Korea Contents Association Conference
- /
- 2007.11a
- /
- pp.595-598
- /
- 2007
Vast amount of XML documents raise interests in how they will be used and how far their usage can be expanded. This paper has two central goals: 1) easy and fast retrieval of XML documents or relevant elements; and 2) efficient and stable management of large-size XML documents. The keys to develop such a practical system are how to segment a large XML document to smaller fragments and how to store them. In order to achieve these goals, we designed SCOF(Service-centric Object Fragmentation) model, which is a semi-decomposition method based on conversion rules provided by XML database managers. Keyword-based search using SCOF model then retrieves the specific elements or attributes of XML documents, just as typical XML query language does. Even though this approach needs the wisdom of managers in XML document collection, SCOF model makes it efficient both retrieval and management of massive XML documents.
PDF

Classification and Retrieval of XML Document for Teacher Support System based on Web (웹 기반의 교수 지원 시스템을 위한 XML 문서의 분류 및 검색)

Kim, Haeng-Kon;Kim, Ji-Young;Choi, Mun-Kyoung;Kim, Soung-Won
- Proceedings of the Korea Information Processing Society Conference
- /
- 2001.10b
- /
- pp.1615-1618
- /
- 2001
최근 인터넷이 급속히 성장함에 따라 웹을 기반으로 한 학습이 활발히 진행되고 있고, 또한 학교 업무의 효율화를 지원하기 위한 분야에서도 웹이 응용되고 있다. 특히 웹에서 교수를 위한 복잡한 학교 업무의 관리와 학습자료 및 업무 자료를 지원하기 위해서는 확장성과 호환성, 편의성을 가진 XML 형태의 문서가 제공되어져야 한다. 따라서 교수 업무 지원을 위해 XML 문서의 정보들을 효율적이고 정확하게 이용하기 위해 이들 문서를 적절하게 분류하고 저장, 검색하기 위한 방법이 필요하다. 본 논문에서는 XML로 작성된 교수 업무 지원 문서의 저장과 검색을 위한 선행작업으로서, 일반적인 메타 데이터와 DTD 데이터를 정의하고, 이렇게 정의된 데이터를 이용하여 패싯 검색과 구조기반 검색, 키워드 검색을 제공함으로써 사용자는 원하는 문서를 쉽게 검색한 수 있다. 따라서 이를 통해 교수 업무 지원 문서들을 웹 상에서 효율적이고 정확하게 저장하며, 사용자가 원하는 문서를 정확하고 신속하게 검색할 수 있게 하고자 한다.
PDF

XML Based Multimedia Retrieval System supporting Scene Search (장면 검색을 지원하는 XML 기반 멀티미디어 검색 시스템)

Joung, Mi-Ra;Hwang, Bu-Hyun
- Proceedings of the Korea Information Processing Society Conference
- /
- 2001.10a
- /
- pp.133-136
- /
- 2001
오디오 비디오 데이터의 활용이 증가함에 따라 멀티미디어 데이터의 내용에 대해 표현하려는 연구와 함께 멀티미디어 데이터의 내용이나 메타데이터를 저장하고, 검색하고, 조작하는 연구의 필요성이 증가하였다. 멀티미디어 데이터의 표현은 사용자가 원하는 내용만을 쉽게 검색하고, 접근한 수 있도록 표현되고 저장되어야 한다. 그러나 기존의 멀티미디어 검색 시스템들은 특정 객체에 중점을 두고 색상, 위치, 모양 등의 정보를 가지고 유사 객체를 찾는 방식을 취하고 있으므로 특정 사건이나 구체적인 인물 정보나 에피소드의 정보를 검색하고자 한 때는 키워드에 의한 검색을 해야하므로 불필요한 정보가 다량으로 검색되며 여러 번의 검색이 이루어져야 하는 단점이 있다. 또한 일반 사용자들은 주로 특정 장면에서 특정 객체의 특징이나 행동, 장소, 사건 등의 정보에 대해 관심을 갖고, 이에 따른 질의를 하는 경향이 있다. 따라서 본 논문에서는 "장면"이라는 계층 구조에 중점을 두고 멀티미디어 데이터의 내용 정보와 구조 정보를 표현 및 저장을 하며, 사용자는 특정 사건이나 객체들의 특징 정보를 가지고 장면이나 전체 구조를 검색찬 수 있는 시스템을 설계하고 구현한다. 멀티미디어 데이터의 표현 및 저장 검색의 모든 과정은 데이터의 재사용성과 접근 용이성을 위해 XML을 기반으로 하여 처리된다. 이렇게 XML로 표현된 데이터는 사용자들에게 구조 정보나 내용 정보에 있어서 다양한 검색 결과를 제공할 수 있는 장점이 있다.
PDF

Semantic based Research-Paper Searching System (시맨틱을 이용한 연구 논문 검색 시스템)

Kim Young-Min;Lee Sang-Joon
- Journal of Internet Computing and Services
- /
- v.4 no.3
- /
- pp.15-22
- /
- 2003
Many information storage systems, such as database system, were needed to integrate much informations into one system and to provide mare voluminous lump of informations. But as the size of information system becomes larger, the responded result size of existing keyword based searching system might be too large and couldn't do the exact search which the user intends to. In this study, we proposed a paper searching method which uses RDF semantic. For this, we analyzed the structural forms of the tit1e of research papers and reconstructed them into RDF/XML. When we use these RDF descriptions of the titles to search papers, we could get more precise and accurate results than keyword based searching method.
PDF

Search Result 53, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)