XML Vicw Indexing

XML 뷰 인덱싱

  • 김영성 (중앙대학교 컴퓨터공학과) ;
  • 강현철 (중앙대학교 컴퓨터공학과)
  • Published : 2003.06.01

Abstract

The view mechanism provides users with appropriate portions of database through data filtering and integration. In the Web era where information proliferates, the view concept is also useful for XML, a future standard for data exchange on the Web. This paper proposes a method of implementing XML views called XML view indexing, whereby XML view xv is represented as an XML view index(XVI) which is a structure containing the identifiers of xv's underlying XML elements as well as the information on xv. Since XVI for xv stores just the identifiers of the XML elements but not the elements themselves, when a user requests to retrieve xv, its XVI should be materialized against xv's underlying XML documents. Also an efficient algorithm to incrementally maintain consistency of XVI given a update of xv's underlying XML documents is required. This paper proposes and implements data structures and algorithms for XML view indexing. The performance experiments on XML view indexing reveal that it outperforms view recomputation for repeated accesses to the view, and requires as much as about 30 times less storage space compared to XML view materialization though the latter takes less time for repeated accesses to the view due to no need of materialization.

뷰는 이질적인 데이타의 통합 및 여과(filtering) 기능을 통해서 데이타베이스의 필요한 부분을 제공한다. 많은 정보가 쏟아지고 있는 웹 환경의 데이타 교환 표준인 XML에 대해서도 뷰의 개념은 유용하다. 본 논문은 XML 뷰 인덱싱이라고 명명한 XML 뷰를 구현하는 기법을 제안한다. XML 뷰는 XML 뷰에 대한 정보와 더불어 뷰를 구성하는 하부 XML 엘리먼트에 대한 식별자를 저장하는 구조인 XML 뷰 인덱스로 표현된다. XML 뷰 인덱스가 XML 엘리먼트 자체가 아닌 식별자만을 저장하므로, 사용자가 XML 뷰를 요청하면 하부 XML 문서를 기반으로 XML 뷰를 실체화해야 한다. 또한, 하부 XML 문서에 대한 변경에 대하여 XML 뷰 인덱스의 일관성을 유지하기 위한 효율적인 점진적 갱신 기법이 필요하다. 본 논문에서는 XML 뷰 인덱싱을 위한 자료구조와 알고리즘을 제안하고 구현하였다. 성능 평가 결과 XML 뷰 인덱싱을 사용하는 것이 매번 뷰를 재생성하는 경우보다 질의 재수행 시간이 적게 걸렸다. XML 뷰 인덱싱 기법이 실체화 시간으로 인해 XML 실체뷰 기법보다 질의 재수행 시간은 많이 걸리지만, 저장 공간 면에서는 약 30배 정도 효율적인 것으로 나타났다.

Keywords

References

  1. S. Abiteboul, 'On Views and XML,' Proc. ACM Symp, on Principles of Database System, 1999, pp. 1-9
  2. S. Abiteboul et al., 'Active Views for Electronic Commerce,' Proc. Int'l Conf. on VLDB, 1999, pp. 138-149
  3. S. Kim and H. Kang, 'XML Query Processing Using Materialized Views,' Proc, Int 'I Conf. on Internet Computing, Jun. 2001, pp. 111-117
  4. Y. Papakonstantinou and V. Vianu, 'DTD Inference for Views of XML Data,' Proc. of 19th ACM SIGACT-SIGMOD-SIGART Symp. on PODS, 2000
  5. Lucie Xyleme, 'A Dynamic Warehouse for XML Data of the Web,' IEEE Data Eng. Bulletin, Vol. 24, No.2, Jun. 2001, pp. 40-47
  6. httpv/www.xylerne.com
  7. M. Fernandez et al., 'Silkltoute: Trading between Relations and XML,' Proc, the 9-th WWW Conf., 2000, pp. 723-746
  8. J. Shanmugasundaram et al., 'Efficiently Publishing Relational Data as XML Documents,' Proc. Int'l Conf. on VLDB, 2000, pp. 65-76
  9. J. Chen et al., 'NiagaraCQ: A Scalable Continous Query System for Internet Databases,' Proc, ACM SIGMOD Int'l Conf. on Management of Data, 2000, pp. 379-390
  10. M. Stonebraker, 'Implementation of Integrity Constraints and Views by Query Modification,' Proc, ACM SIGMOD Int'l Conf. on Management of Data, 1975, pp, 65-78 https://doi.org/10.1145/500080.500091
  11. A. Gupta and I. Mumick, 'Materialized Views: Techniques, Implementations, and Applications,' MIT Press, 1999
  12. N. Roussopoulos, 'An Incremental Access Method for ViewCache: Concept, Algorithms, and Cost Analysis,' ACM Trans. on Database Systems, Vol. 16, No.3, Sep. 1991, pp, 535-563 https://doi.org/10.1145/111197.111215
  13. P. Valduriez, 'Join Indices,' ACM Trans. on Database Systems, Vol. 12, No.2, Jun. 1987, pp. 218-246 https://doi.org/10.1145/22952.22955
  14. S. Abiteboul et al., 'Incremental Maintenance for Materialized Views over Semistructured Data,' Proc. Int'l Conf. on VLDB, 1998, pp. 38-49
  15. D. Suciu, 'Query Decomposition and View Maintenance for Query Languages for Unstructured Data,' Proc. Int'l Conf. on VLDB, 1996, pp. 227-238
  16. Y. Zhuge and H Garcia-Molina, 'Graph Structured Views and Their Incremental Maintenance,' Proc. Int'l Conf.. on Data Engineering, 1998, pp, 116-125 https://doi.org/10.1109/ICDE.1998.655767
  17. P. Buneman et al, 'Programming Constructs for Unstructured Data,' Proc. DBPL, 1995
  18. R. Cattell et al., 'The Object Database Standard: ODMG-93,' Morgan Kaufmann, 1994
  19. Y. Papakonstantinou et al., 'Object Exchange across Heterogeneous Information Sources,' Proc. Int'l Conf. on Data Engineering, 1995, pp. 251-260 https://doi.org/10.1109/ICDE.1995.380386
  20. J. McHugh et al., 'Lore: A database Management System for Semistructured Data,' SIGMOD Record, Vol. 26, No.3, Sep. 1997, pp. 54-66 https://doi.org/10.1145/262762.262770
  21. S. Abiteboul et al, 'The Lorel Query Language for Semistructured Data,' J of Digital Libraries, Vol. 1, No. 1, Nov. 1996 https://doi.org/10.1007/s007990050005
  22. R. Goldman and J Widom, 'DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases,' Proc. VLDB, 1997, pp. 436-445
  23. J McHugh et al., 'Indexing Semistructured Data,' Tech. Report, Dept. of Computer Science, Stanford Univ., 1998
  24. T. Milo and D. Suciu, 'Index Structures for Path Expressions,' Proc. ICDT, 1999
  25. V. Apparao et al., 'Document Object Model Levell (Second Edition) (W3C Working Draft),' http://www.w3.org/TR/REC-DOM-Level-1/, 1998
  26. D. Chamberlin et al., 'XQuery 1.0: An XML Query Language,' http://www.w3.org/R/xquery, 2001
  27. D. Chamberlin et al., 'Quilt: an XML Query Language for Heterogeneous Data Sources,' In Lecture Notes in Computer Science, Springer-Verlag, Dec. 2000
  28. World Wide Web Consortium. 'XML Path Language (Xf'ath) Version 2.0,' W3C Working Draft, December, 2001. See http://www.w3.org/TR/xpath20/
  29. J. Robie et al., 'XML Query Lanaguage(XQL),' http://www.w3.org/TandS/QL/QL98/pp/xql.html, 1998
  30. A. Deutsch et al., 'XML-QL: A Query Language for XML,' http://www.w3.org/TR/1998/NOTE-xml-ql/, 1998
  31. SourceForge, 'Kweelt,' http://kweelt.sourceforge.net/, 2000