DOI QR코드

DOI QR Code

A Tuning Algorithm for the Multidimensional Type Inheritance Index of XML Databases

XML 데이터베이스 다차원 타입상속 색인구조의 조율 알고리즘

  • 이종학 (대구가툴릭대학교 컴퓨터정보통신공학부)
  • Received : 2010.08.09
  • Accepted : 2010.12.21
  • Published : 2011.02.28

Abstract

For the MD-TIX(multidimensional type inheritance index) that supports query processing for the type inheritance concept in XML databases, this paper presents an index tuning algorithm that enhances the performance of the XML query processing according to the query pattern. The MD-TIX uses a multidimensional index structure to support complex XML queries involving both nested elements and type inheritance hierarchies. In this index tuning algorithm, we first determine a shape of index page regions by using the query information about the user's query pattern, and then construct an optimal MD-TIX by applying a region splitting strategy that makes the shape of the page regions into the predetermined one. The performance evaluation results indicate that the proposed tuning algorithm builds an optimal MD-TIX by a given query pattern, and in the case of the three-dimensional query regions for the nested predicates of path length 2, the performance is much enhanced according to the skewed degree of the query region's shape.

XML 데이터베이스에서 타입상속 개념의 질의처리를 지원하기 위한 다차원 타입상속 색인구조(Multidimensional Type Inheritance Index: MD-TIX)에 대하여, 본 논문에서는 질의 패턴에 따라 색인성능을 향상시키기 위한 색인구조의 조율 알고리즘을 제안한다. MD-TIX는 중첩 엘리먼트와 타입상속 계층이 포함된 복합 형태의 XML 질의처리를 지원하기 위하여 다차원 색인구조를 이용한다. MD-TIX의 조율 알고리즘에서는 먼저 사용자 질의 형태에 대한 질의 정보로서 색인구조를 구성할 색인 페이지 영역들의 모양을 결정하고, 이러한 모양의 페이지 영역들을 갖도록 하는 구간반분 전략을 적용함으로써 최적의 MD-TIX 색인구조를 구성한다. 성능평가의 결과에 의하면, 주어진 질의 패턴에 따라 제안한 조율 알고리즘을 적용함으로서 최적의 MD-TIX를 구성할 수 있었으며, 경로 길이가 2인 경우에 주어지는 중첩 술어에 대한 삼차원 질의 영역의 경우, 모양이 편향된 정도에 따라 질의처리의 성능이 매우 크게 향상됨을 알 수 있었다.

Keywords

References

  1. T. Bray et al., Extensible Markup Language, (XML) 1.0. W3C Recommendation, http: //www.w3.org /TR /REC-xml-19980210, Feb. 2004.
  2. W. Meier, "eXist: An Open Source native XML Database," Web, Web-Services, and Database Systems, NODe 2002 Web- and Database-Related Workshops, Revised Papers (Lecture Notes in Computer Science Vol.2593), pp.169-183, 2003.
  3. C.D. Fallside and P. Walmsley, XML Schema Part 0. W3C Recommendation, http://www.w3.org/TR/xmlschema-0, Oct. 2004.
  4. A. Berglund et al., "XML Path Language (XPath) 2.0. W3C Working Draft 30 Apr. 2002," http://www.w3.org/TR/xpath20, Working Draft, 2002.
  5. R. Goldman and J. Widom, "DataGuides: Enable Query Formulation and Optimization in Semistructured DataBases," In Proc. Int'l Conf. on Very Large Data Bases, Athens, Greece, pp.436-445, Aug. 1999.
  6. T. Milo and D. Suciu, "Index Structures for Path Expression," In Proc. Int'l Conf. on Database Theory, Jerusalem, Israel, pp.277- 295, Jan. 1999.
  7. C.W. Chung, J.K. Min, and K. Shim. "APEX: An Adaptive Path Index for XML Data," In Proc. Intl. Conf. on Management of Data, ACM SIGMOD, Madison, Wisconsin, pp. 121- 132, June, 2005.
  8. K.P. Leela, and J.R. Haritsa, "Schema-conscious XML indexing," Information Systems 32, pp.344-364, 2007. https://doi.org/10.1016/j.is.2005.10.002
  9. J.H. Lee, "MD-TIX: Multidimensional Type Inheritance Indexing for Efficient Execution of XML Queries," Journal of Korea Multimedia Society, Vol.10, No.9, pp.1093-1105, Sept. 2007.
  10. J.H. Lee et al., "A Region Splitting Strategy for Physical Database Design of Multidimensional File Organizations," In Proc. Int'l Conf. on Very Large Data Bases, Athens, Greece, pp.416-425, Aug. 1997.
  11. D. Lomet and B. Salzberg, "The hB-tree: A Multiattribute Indexing Method with Good Guaranteed Performance," ACM Trans. on Database Systems, Vol.15, No.4, pp.625-658, Dec. 1990. https://doi.org/10.1145/99935.99949
  12. K.Y. Whang and R. Krishnamurthy, "The Multilevel Grid File- A Dynamic Hierarchical Multidimensional File Structure," In Proc. Intl. Conf. on Database Systems for Advanced Applications(DASFAA), Tokyo, pp.449-459, Apr. 1991.
  13. S. Boag et al., XQuery 1.0: An XML Query Language, http://www.w3.org/TR/xquery, Nov. 2005.
  14. S.C. Haw and C.S. Lee, "Extending path summary and region encoding for efficient structural query processing in native XML databases," The Journal of Systems and Software 82, pp.1025-1035, 2009. https://doi.org/10.1016/j.jss.2009.01.007
  15. N. Bruno, N. Koudas, and D. Srivastava, "Holistic twig joins: optimal XML pattern matching," In Proceeding of the 2002 ACM SIGMOD international conference on Management of data, pp.310-321, 2002.
  16. T. Chen, J. Lu, and T.W. Ling, "On boosting holism in XML twig pattern matching using structural indexing techniques," In Proceeding of the 2005 ACM SIGMOD international conference on Management of data, pp.455- 466, 2005.
  17. R. Kaushik et al., "On the integration of structure indexes and inverted lists," In Proceeding of the 2004 ACM SIGMOD international conference on Management of data, pp.779-790, 2004.