An Efficient BitmapInvert Index based on Relative Position Coordinate for Retrieval of XML documents

효율적인 XML검색을 위한 상대 위치 좌표 기반의 BitmapInvert Index 기법

  • 김택곤 (광운대학교 컴퓨터과학과) ;
  • 김우생 (광운대학교 컴퓨터과학과)
  • Published : 2006.01.01

Abstract

Recently, a lot of index techniques for storing and querying XML document have been studied so far and many researches of them used coordinate-based methods. But update operation and query processing to express structural relations among elements, attributes and texts make a large burden. In this paper, we propose an efficient BitmapInvert index technique based on Relative Position Coordinate (RPC). RPC has good preformance even if there are frequent update operations because it represents relationship among parent node and left, right sibling nodes. BitmapInvert index supports tort query with bitwise operations and does not casue serious performance degradations on update operations using PostUpdate algerian. Overall, the performance could be improved by reduction of the number of times for traversing nodes.

최근 XML 문서의 저장 및 관리에 대한 연구가 활발히 이루어지고 있다. XML 문서를 위한 현재까지 연구된 대부분의 색인기법의 경우 절대좌표를 이용하여 표현하는 방법을 사용하므로 갱신연산이 커다란 부담으로 받아들여지고 있다. 본 논문에서는 이를 개선하기 위하여 상대위치좌표에 기반한 BitmapInvert 색인 기법을 제안하였다. 상대위치좌표는 부모 노드와 좌, 우 형제 노드들 간의 관계만을 표현하므로 갱신연산지 자주 발생하더라도 좋은 성능을 보인다. BitmapInvert 색인 기법은 bitwise 연산을 지원하는 텍스트 검색을 지원하고, PostUpdate 알고리즘을 적용하여 갱신에 따른 성능 저하를 줄이도록 하였다. 따라서, 제안하는 기법은 검색이나 갱신에 대해서 접근하는 노드를 줄임으로써 전체적인 성능을 향상시킬 수 있다.

Keywords

References

  1. ISO, 'Information Processing-Text and Office System-Standard Generalized Markup Language (SGML),' ISO/IEC 8879, Oct. 15, 1986
  2. T. Bray, et al, 'Extensible Markup Language (XML) 1.0 (Second Edition),' http://www.w3.org/TR/2000/REC-xml-20001006
  3. Ron Sacks-Davis, Tuong Dao, James A. Thom, and Justin Zobel, 'Indexing documents for queries on structure, content and attributes,' In International Symposium on Digital Media Information Base (DMIB'97), Nov. 1997
  4. Charles L. A. Clarke, G. V. Cormack, F. J. Burkowski, 'An algebra for structured text search and a framework for its implementation,' The Computer Journal, 38(1), pp.43-56, 1995 https://doi.org/10.1093/comjnl/38.1.43
  5. Dao Dinh Kha, Masatoshi Yoshikawa, Shunsuke Uernura, 'An XML indexing structure with relative region coordinate,' Proceedings. 17th International Conference on Data Engineering (ICDE'2001), April 2001 https://doi.org/10.1109/ICDE.2001.914843
  6. Jong P. Yoon, Vijay Raghavan, Venu Chakilam, 'BitCube : a three-dimensional bitmap indexing for XML documents,' Proceedings. Thirteenth International Conference on Scientific and Statistical Database Management (SDBM'2001), July 2001 https://doi.org/10.1109/SSDM.2001.938548
  7. J. Yoon, V. Rahgavan, and V. Chakilam, 'BitCube: Clustering and Statistical Analysis for XML Documents,' 13th International Conference on Scientific and Statistical Database Management, Virginia, July 2001
  8. C. Cheng, J. Naughton, D. DeWitt, Q. Luo, and G. Lohman, 'On supporting containment queries in relational database management system', ACM SIGMOD, pp.425-436, 2001 https://doi.org/10.1145/376284.375722
  9. Hector Garcia-Molina, Jeffrey D.Ullman, Jennifer Wiom, 'Database Systems: The Complete Book', Prentice Hall
  10. Silberschatz, Korth, Sudarshan, 'Database System Concepts 4th Edition', Mc Graw Hill
  11. Jungsuk Song, Woosaeng Kim, 'Extensible index technique for storing and retrieving XML documents', The 4th International Conference on Computer and Information Technology, pp 280-287, Sep. 2004 https://doi.org/10.1109/CIT.2004.1357209
  12. Tackgon Kim, Woosaeng Kim, 'A XML Index Technique with Relative Position Coordinate for Storing and Retrieving XML documents', International Tecnical Conference on Cirrcuits/Systems, Computers and Communications (ITC-CSCC), 20th Conference, vol 2, pp.787-788, July 2005
  13. Igor Tatarinov, Zachary G. Ives, Alan Y. Halevy, Daniel S. Weld, 'Updating XML', ACM SIGMOD, pp413-424, May 2001 https://doi.org/10.1145/376284.375720