Spatial Join based on the Transform-Space View

변환공간 뷰를 기반으로한 공간 조인

  • 이민재 (한국과학기술원 전자전산학과) ;
  • 한욱신 (경북대학교 컴퓨터공학과) ;
  • 황규영 (한국과학기술원 전자전산학과)
  • Published : 2003.10.01

Abstract

Spatial joins find pairs of objects that overlap with each other. In spatial joins using indexes, original-space indexes such as the R-tree are widely used. An original-space index is the one that indexes objects as represented in the original space. Since original-space indexes deal with sizes of objects, it is difficult to develop a formal algorithm without relying on heuristics. On the other hand, transform-space indexes, which transform objects in the original space into points in the transform space and index them, deal only with points but no sites. Thus, spatial join algorithms using these indexes are relatively simple and can be formally developed. However, the disadvantage of transform-space join algorithms is that they cannot be applied to original-space indexes such as the R-tree containing original-space objects. In this paper, we present a novel mechanism for achieving the best of these two types of algorithms. Specifically, we propose a new notion of the transform-space view and present the transform-space view join algorithm(TSVJ). A transform-space view is a virtual transform-space index based on an original-space index. It allows us to interpret on-the-fly a pre-built original-space index as a transform-space index without incurring any overhead and without actually modifying the structure of the original-space index or changing object representation. The experimental result shows that, compared to existing spatial join algorithms that use R-trees in the original space, the TSVJ improves the number of disk accesses by up to 43.1% The most important contribution of this paper is to show that we can use original-space indexes, such as the R-tree, in the transform space by interpreting them through the notion of the transform-space view. We believe that this new notion provides a framework for developing various new spatial query processing algorithms in the transform space.

공간 조인이란 서로 겹치는 관계를 가지는 공간 객체의 쌍들을 찾는 질의이다. 색인 기반 공간 조인에는 원공간 색인인 R 트리가 널리 사용된다. 원공간 색인이란 원공간상에서 표현된 공간 객체를 색인하는 구조로, 이를 활용한 조인은 크기를 가지는 공간 객체를 다루기 때문에 정형적인 방법이 아닌 휴리스틱에 의존하는 단점을 가진다. 반면, 변환공간 색인은 원공간 상의 공간 객체를 변환공간 상의 크기가 없는 점 객체로 변환하여 색인한 후에 이들을 다루기 때문에, 이를 활용한 공간 조인은 상대적으로 단순하고 정형적인 방법을 사용하는 장점을 가진다. 그러나, 이 방법은 R 트리와 같이 원공간 객체를 색인하는 원공간 색인에는 적용될 수 없는 문제점을 가진다. 본 논문에서는 이 두 방법의 장점만을 취하는 새로운 방법을 제안한다. 즉, 변환공간 뷰(transform-space view)라는 새로운 개념과 이를 사용한 공간 조인 알고리즘인 변환공간 뷰 조인 알고리즘(transform-space view join algorithm)을 제안한다. 변환공간 뷰란 원공간 색인에 대한 가상의 변환공간 색인으로서, 이미 구축된 원공간 색인을 구조적으로 변경하지 않고서 별도의 추가비용 없이 가상의 변환공간 색인으로 해석할 수 있게 한다. 실험 결과, 변환공간 뷰 조인알고리즘은 R 트리를 원공간에서 조인하는 알고리즘들과 비교하여 디스크 액세스 횟수 측면에서 최대 43.1%까지 더 좋은 성능을 보인다. 본 논문의 가장 중요한 공헌은 R 트리와 같이 널리 사용되는 원공간 색인을 변환공간 뷰라는 새로운 개념을 통하여 변환공간에서 해석하여 사용할 수 있음을 보인 것이다. 우리는 이 새로운 개념이 다양한 공간 질의 처리 알고리즘들이 변환공간에서 새롭게 개발될 수 있는 프레임워크를 마련했다고 믿는다.

Keywords

References

  1. T. Brinkhoff, H.-P. Kriegel, and B. Seeger, 'Efficient Processing of Spatial Joins Using R-Trees,' In Proc. Int'l Conf. on Management of Data, ACM SIGMOD, pp. 237-246, May 1993 https://doi.org/10.1145/170035.170075
  2. O. Gunther, 'Efficient Computation of Spatial Joins,' In Proc. the Ninth Int'l Conf. on Data Engineering, pp. 50-59, 1993 https://doi.org/10.1109/ICDE.1993.344078
  3. Y.-W. Huang and N. Jing, 'Spatial Joins Using R-trees: Breadth-First Traversal with Global Optimizations,' In Proc. the 23rd Int'l Conf. on Very Large Data Bases, pp. 396-405, 1997
  4. M. L. La and C. V. Ravishankar, 'Spatial joins using seeded trees,' In Proceedings of ACM SIGMOD International Conference on Management of Data, Minneapolis, MN, pp. 209-220, May, 1994 https://doi.org/10.1145/191839.191881
  5. M.-L. Lo and C. V. Ravishankar, 'Spatial Hash-Joins,' In Proc. Int'l Conf. on Management of Data, ACM SIGMOD, pp. 247-258, June, 1996 https://doi.org/10.1145/233269.233337
  6. J. M. Patel and D. J. DeWitt, 'Partition Based Spatial-Merge Join,' In Proceedings of ACM SIGMOD International Conference on Management of Data, pp. 259-270, Jun. 1996 https://doi.org/10.1145/233269.233338
  7. J. Song, K. Whang, Y. Lee, M. Lee, and S. Kim, 'Spatial Join Processing Using Comer Transformation,' IEEE Trans. on Knowledge and Data Engineering, Vol. 11, No. 4, pp. 688-695, 1999 https://doi.org/10.1109/69.790844
  8. N. Mamoulis and D. Papadias, 'Slot Index Spatial Join' IEEE Transactions on Knowledge and Data Engineering, Vol.15, No.1, Jan/Feb., 2003 https://doi.org/10.1109/TKDE.2003.1161591
  9. N. Beckmann et al., 'The $R^{*}$-tree : An Efficient and Robust Access Method for Points and Rectangles,' In Proc. Int'l, Conf. on Management of Data, ACM SIGMOD, pp.322-331, May, 1990 https://doi.org/10.1145/93597.98741
  10. A. Guttman, 'R-Trees: A Dynamic Index Structure for Spatial Searching,' In Proceedings of ACM SIGMOD International Conference on Management of Data, pp.47-57, Jun., 1984 https://doi.org/10.1145/602259.602266
  11. J. Lee, Y. Lee, K. Whang, and I. Song, 'A Region Splitting Strategy for Physical Database Design of Multidimensional File Organizations,' In Proc. the 23rd Int'l Conf. on Very Large Data Bases, pp. 416-425, 1997
  12. K. Whang and R. Krishnamurthy, Multilevel Grid Files, IBM Research Report RC 11516, 1985
  13. S. Berchtold, D. Keirn, and H.-P. Kriegel, 'The X-tree: An Index Structure for High-Dimensional Data,' In Proc. the 22nd Int'l Conf. on Very Large Data Bases, pp. 28-39, 1996
  14. B. C. Ooi, K. J. Mcdonell, and R. Sacks-Davis, 'Spatial kd-tree: An Indexing Mechanism for Spatial Databases,' In Proc. IEEE Computer Software and Applications Conference, pp. 433-438, 1987
  15. Y. Ohsawa and M. Sakauchi, 'A New Tree Type Data Structure with Homogeneous Node Suitable for a Very Large Spatial Database,' In Proc. the Sixth IEEE Int'l Conf. on Data Engineering, pp. 296-303. 1990 https://doi.org/10.1109/ICDE.1990.113481
  16. H. V. Jagadish, 'Spatial Search with Polyhedra,' In Proc. the 16th IEEE Int'l Conf. on Data Engineering, pp. 311-319, 1990 https://doi.org/10.1109/ICDE.1990.113483
  17. O. Gunther, 'The Cell Tree: An Object-Oriented Index Structure for Geometric Databases,' In Proc. the 15th IEEE Int'l Conf. on Data Engineering, pp. 598-605, 1989 https://doi.org/10.1109/ICDE.1989.47267
  18. K. Hinrichs and J. Nievergelt, 'The Grid File: A Data Structure Designed to Support Proximity Queries on Spatial Objects,' In Proc. Int'l Workshop on Graph Theoretic Concepts in Computer Science, pp. 100-113, 1983
  19. B. Seeger and H.-P. Kriegel, 'Techniques for Design and Implementation of Efficient Spatial Access Methods,' In Proc. the 14th Int'l Conf. on Very Large Data Bases, pp. 360-371, 1988
  20. H. Samet, The Design and Analysis of Spatial Data Structures, Addision-Wesley Publishing Company, Inc., 1990
  21. V. Gaede, O. Gunther, 'Multidimensional Access Methods,' ACM Computing Surveys, 30(2), pp.170-231, 1998 https://doi.org/10.1145/280277.280279
  22. H. V. Jagadish, Linear Clustering of Objects with Multiple Attributes, In Proc. Int'l Conf. on Management of Data, ACM SIGMOD, pp. 332-342, May 1990 https://doi.org/10.1145/93597.98742
  23. J. K. Lawder and P. J. H. King, Querying Multi-Dimensional Data Indexed Using the Hilbert Space-filling Curve, SIGMOD Record, Vol.30, No.1, pp. 19-24, 2001 https://doi.org/10.1145/373626.373678