Spatial Selectivity Estimation for Intersection region Information Using Cumulative Density Histogram

  • Kim byung Cheol (Database Laboratory, Chungbuk National University) ;
  • Moon Kyung Do (Database Laboratory, Chungbuk National University) ;
  • Ryu Keun Ho (Database Laboratory, Chungbuk National University)
  • Published : 2004.10.01

Abstract

Multiple-count problem is occurred when rectangle objects span across several buckets. The Cumulative Density (CD) histogram is a technique which solves multiple-count problem by keeping four sub-histograms corresponding to the four points of rectangle. Although it provides exact results with constant response time, there is still a considerable issue. Since it is based on a query window which aligns with a given grid, a number of errors may be occurred when it is applied to real applications. In this paper, we proposed selectivity estimation techniques using the generalized cumulative density histogram based on two probabilistic models: (1) probabilistic model which considers the query window area ratio, (2) probabilistic model which considers intersection area between a given grid and objects. In order to evaluate the proposed methods, we experimented with real dataset and experimental results showed that the proposed technique was superior to the existing selectivity estimation techniques. The proposed techniques can be used to accurately quantify the selectivity of the spatial range query on rectangle objects.

Keywords