• Title/Summary/Keyword: XML Keyword Retrieval

Search Result 18, Processing Time 0.018 seconds

Service-centric Object Fragmentation Model for Efficient Retrieval and Management of Huge XML Documents (대용량 XML 문서의 효율적인 검색과 관리를 위한 SCOF 모델)

  • Jeong, Chang-Hoo;Choi, Yun-Soo;Jin, Du-Seok;Kim, Jin-Suk;Yoon, Hwa-Mook
    • Journal of Internet Computing and Services
    • /
    • v.9 no.1
    • /
    • pp.103-113
    • /
    • 2008
  • Vast amount of XML documents raise interests in how they will be used and how far their usage can be expanded, This paper has two central goals: 1) easy and fast retrieval of XML documents or relevant elements; and 2) efficient and stable management of large-size XML documents, The keys to develop such a practical system are how to segment a large XML document to smaller fragments and how to store them. In order to achieve these goals, we designed SCOF(Service-centric Object Fragmentation) model, which is a semi-decomposition method based on conversion rules provided by XML database managers. Keyword-based search using SCOF model then retrieves the specific elements or attributes of XML documents, just as typical XML query language does. Even though this approach needs the wisdom of managers in XML document collection, SCOF model makes it efficient both retrieval and management of massive XML documents.

  • PDF

XML Fulltext Retrieval System by Extracting Navigation Information (네비게이션 정보추출에 의한 XML 본문검색시스템)

  • 강남규;이응봉;이석형
    • Journal of the Korean Society for information Management
    • /
    • v.19 no.3
    • /
    • pp.91-110
    • /
    • 2002
  • Recently, to overcome the limit of keyword based retrieval system, the study based structured document has been studied. But it is hard for structured retrieval system to adapt a real service, in this paper, we propose a method of retrieval mechanism for the fulltext of XML documents. We explain DTD of XML based report, extracting navigation information and planing to adapt the retrieval system for article retrieval. Using the fulttext retrieval scheme, suggested system can be an alternative plan of professional structured based retrieval system.

Development of a XML Web Services Retrieval Engine (XML 웹 서비스 검색 엔진의 개발)

  • Sohn, Seung-Beom;Oh, Il-Jin;Hwang, Yun-Young;Lee, Kyong-Ha;Lee, Kyu-Chul
    • Journal of Information Technology Applications and Management
    • /
    • v.13 no.4
    • /
    • pp.121-140
    • /
    • 2006
  • UDDI (Universal Discovery Description and Integration) Registry is used for Web Services registration and search. UDDI offers the search result to the keyword-based query. UDDI supports WSDL registration but it does not supports WSDL search. So it is required that contents based search and ranking using name and description in UDDI registration information and WSDL. This paper proposes a retrieval engine considering contents of services registered in the UDDI and WSDL. It uses Vector Space Model for similarity comparison between contents of those. UDDI registry information hierarchy and WSDL hierarchy are considered during searching process. This engine suppports two discovery methods. One is Keyword-based search and the other is template-based search supporting ranking for user's query. Template-based search offers how service interfaces correspond to the query for WSDL documents. Proposed retrieval engine can offer search result more accurately than one which UDDI offers and it can retrieve WSDL which is registered in UDDI in detail.

  • PDF

Ranked Web Service Retrieval by Keyword Search (키워드 질의를 이용한 순위화된 웹 서비스 검색 기법)

  • Lee, Kyong-Ha;Lee, Kyu-Chul;Kim, Kyong-Ok
    • The Journal of Society for e-Business Studies
    • /
    • v.13 no.2
    • /
    • pp.213-223
    • /
    • 2008
  • The efficient discovery of services from a large scale collection of services has become an important issue[7, 24]. We studied a syntactic method for Web service discovery, rather than a semantic method. We regarded a service discovery as a retrieval problem on the proprietary XML formats, which were service descriptions in a registry DB. We modeled services and queries as probabilistic values and devised similarity-based retrieval techniques. The benefits of our way are follows. First, our system supports ranked service retrieval by keyword search. Second, we considers both of UDDI data and WSDL definitions of services amid query evaluation time. Last, our technique can be easily implemented on the off-theshelf DBMS and also utilize good features of DBMS maintenance.

  • PDF

A Study for the Effective Classification and Retrieval of Software Component (효과적인 소프트웨어 컴포넌트 분류 및 검색에 관한 연구)

  • Cho, Byung-Ho
    • Journal of Internet Computing and Services
    • /
    • v.7 no.6
    • /
    • pp.1-10
    • /
    • 2006
  • A software development using components reuse is an useful method to reduce the software development cost. But a retrieval method by the keyword and category classifications is difficult to search an exact matching component due to components complexity in component reuse. Therefore, after different existing methods are examined and analyzed, an effective classification and retrieval method using XML specifications and the system architecture of components integrated management based on it are presented. Many discording elements of DTD which is component meta-expression exist in components retrieval. To compensate it, this retrieval method using estimations of precision and concision is effective one to catch considerable matching preference components. This method makes possible to retrieve suitable components having better priority due to searching similar matching components that are difficult in an existing keyword matching method.

  • PDF

XML-based Modeling for Semantic Retrieval of Syslog Data (Syslog 데이터의 의미론적 검색을 위한 XML 기반의 모델링)

  • Lee Seok-Joon;Shin Dong-Cheon;Park Sei-Kwon
    • The KIPS Transactions:PartD
    • /
    • v.13D no.2 s.105
    • /
    • pp.147-156
    • /
    • 2006
  • Event logging plays increasingly an important role in system and network management, and syslog is a de-facto standard for logging system events. However, due to the semi-structured features of Common Log Format data most studies on log analysis focus on the frequent patterns. The extensible Markup Language can provide a nice representation scheme for structure and search of formatted data found in syslog messages. However, previous XML-formatted schemes and applications for system logging are not suitable for semantic approach such as ranking based search or similarity measurement for log data. In this paper, based on ranked keyword search techniques over XML document, we propose an XML tree structure through a new data modeling approach for syslog data. Finally, we show suitability of proposed structure for semantic retrieval.

The Path Inverted Index Technique for XML Document Retrieval (XML 문서 검색을 위한 경로 역 색인 기법)

  • Moon, Kyung-Won;Hwang, Byung-Yeon
    • The KIPS Transactions:PartD
    • /
    • v.17D no.2
    • /
    • pp.103-110
    • /
    • 2010
  • Recently, many XML document management systems using the advantage of RDBMS have been actively developed for the storage, processing and retrieval of XML documents. However, fractional pattern-matching query such as the LIKE operations cannot take the advantage of the index of RDBMS because these operations have deteriorated retrieval performance through its inefficient comparison processing. The hierarchical XML storage technique which stores XML documents in RDBMS efficiently, and the path inverted index technique are proposed in this paper. It regards the element of an XML document as a keyword, and focuses on organizing a posting file with path identifiers and sequences to reduce the retrieval time of path based query. Through simulations, our methods have shown about 60% better performance than the conventional method using RDBMS in searching.

Document Filtering Algorithm for Efficient Preprocessing of XML Information Retrieval (XML 정보검색의 효율적 전처리를 위한 문서여과 알고리즘)

  • Kong Yong-Hae;Kim Myung-Sook
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.6 no.1
    • /
    • pp.1-11
    • /
    • 2005
  • The paper proposes a preprocessing method for efficient processing of XML queries in information retrieval with a large amount of XML documents. The conventional preprocessing methods filter out XML documents by parsing XML document for keyword of query or by comparing query signatures with signatures of XML document to be generated. But these methods are dependent on a query and are very in efficient for a large amount of XML documents. For this, we generate a universal DTD based on ontology of a domain. The universal DTD is applicable to the XML documents when they contain information of a same domain even when they have different structures and attributes. Then, using the universal DTD, we filter out the XML documents that are not bounded in the domain. We evaluate the performance of this method through experiments.

  • PDF

An Implementation of Retrieval System for Medical Image Management (의료영상 관리를 위한 검색시스템 구현)

  • Kim, Kyung Soo
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.5 no.4
    • /
    • pp.61-67
    • /
    • 2009
  • PACS and Medical Image System use only high level metadata in retrieving desired image nowadays. In order to retrieve Medical Image Data more efficiently, it would be needed to retrieve similarity by utilizing low level metadata as well as keyword retrieval by high level metadata. Thus, In this paper presents that it has realized similarity retrieval by low level metadata on the basis of MPEG-7, and keyword retrieval by high level metadata of DICOM base. It would be also available to look into medical image data in various methods and read accurate image promptly for diagnosis and treatment by retrieval with integrating two metadata.

XML-based Retrieval System for SCORM-based Virtual Learning Contents (SCORM 기반의 XML 학습 컨텐츠 검색 시스템)

  • Choi, Byung-Uk;Song, Mi-Sook;Cho, Jung-Won
    • The Journal of Korean Association of Computer Education
    • /
    • v.6 no.1
    • /
    • pp.9-17
    • /
    • 2003
  • XML(eXtensible Markup Language), next generation internet standard language has the advantage of easy re-use and re-structure in other computing environment because it has the separate data, presentation and structure. In this paper, we implement the efficient retrieval system for the general user by limiting the XML documents on the multimedia learning contents for the virtual education system. The system design is based on SCO Metadata unit defined in SCORM as the proposed virtual education standard. Each XML documents has three indexes - keyword, element and attribute. Also, it makes possible to retrieve data without previous knowledge of the DTD by making the element retrieval screen structure for the user interface. And it gives the user various result screen formats such as XML and HTML by restructuring the retrieval result through XML-QL and XSL, respectively.

  • PDF