• Title/Summary/Keyword: range sum queries

Search Result 6, Processing Time 0.02 seconds

An Indexing Technique for Range Sum Queries in Spatio - Temporal Databases (시공간 데이타베이스에서 영역 합 질의를 위한 색인 기법)

  • Cho Hyung-Ju;Choi Yong-Jin;Min Jun-Ki;Chung Chin-Wan
    • Journal of KIISE:Databases
    • /
    • v.32 no.2
    • /
    • pp.129-141
    • /
    • 2005
  • Although spatio-temporal databases have received considerable attention recently, there has been little work on processing range sum queries on the historical records of moving objects despite their importance. Since to answer range sum queries, the direct access to a huge amount of data incurs prohibitive computation cost, materialization techniques based on existing index structures are recently suggested. A simple but effective solution is to apply the materialization technique to the MVR-tree known as the most efficient structure for window queries with spatio-temporal conditions. However, the MVR-tree has a difficulty in maintaining pre-aggregated results inside its internal nodes due to cyclic paths between nodes. Aggregate structures based on other index structures such as the HR-tree and the 3DR-tree do not provide satisfactory query performance. In this paper, we propose a new indexing technique called the Adaptive Partitioned Aggregate R-Tree (APART) and query processing algorithms to efficiently process range sum queries in many situations. Experimental results show that the performance of the APART is typically above 2 times better than existing aggregate structures in a wide range of scenarios.

An Algorithm for Computing Range-Groupby Queries (영역-그룹화 질의 계산 알고리즘)

  • Lee, Yeong-Gu;Mun, Yang-Se;Hwang, Gyu-Yeong
    • Journal of KIISE:Databases
    • /
    • v.29 no.4
    • /
    • pp.247-261
    • /
    • 2002
  • Aggregation is an important operation that affects the performance of OLAP systems. In this paper we define a new class of aggregation queries, called range-groupby queries, and present a method for processing them. A range-groupby query is defined as a query that, for an arbitrarily specified region of an n-dimensional cube, computes aggregations for each combination of values of the grouping attributes. Range-groupby queries are used very frequently in analyzing information in MOLAP since they allow us to summarize various trends in an arbitrarily specified subregion of the domain space. In MOLAP applications, in order to improve the performance of query processing, a method of maintaining precomputed aggregation results, called the prefix-sum array, is widely used. For the case of range-groupby queries, however, maintaining precomputed aggregation results for each combination of the grouping attributes incurs enormous storage overhead. Here, we propose a fast algorithm that can compute range-groupby queries with minimal storage overhead. Our algorithm maintains only one prefix-sum away and still effectively processes range-groupby queries for all possible combinations of the grouping attributes. Compared with the method that maintains a prefix-sum array for each combination of the grouping attributes in an n-dimensional cube, our algorithm reduces the space overhead by (equation omitted), while accessing a similar number of cells.

Efficient Processing of MAX-of-SUM Queries in OLAP (OLAP에서 MAX-of-SUM 질의의 효율적인 처리 기법)

  • Cheong, Hee-Jeong;Kim, Dong-Wook;Kim, Jong-Soo;Lee, Yoon-Joon;Kim, Myoung-Ho
    • Journal of KIISE:Databases
    • /
    • v.27 no.2
    • /
    • pp.165-174
    • /
    • 2000
  • Recent researches about range queries in OLAP are only concerned with applying an aggregation operator over a certain region. However, data analysts in real world need not only the simple range query pattern but also an extended range query pattern that finds ranges which satisfy a special condition specified by using several aggregation operators. In this work, we define the general form of the extended range query and propose an efficient processing method for the 'MAX -of-SUM' query, which is the representative form of the extended range query pattern. The MAX-of-SUM query finds the range which has the maximum range sum value in data cube where the size of the range is given. The proposed query processing method is based on the prediction of the scope of the range sum values. That is, the search space on the query processing can be reduced by using the result of the prediction, and hence, the query response time is also reduced.

  • PDF

Efficient Processing method of OLAP Range-Sum Queries in a dynamic warehouse environment (다이나믹 데이터 웨어하우스 환경에서 OLAP 영역-합 질의의 효율적인 처리 방법)

  • Chun, Seok-Ju;Lee, Ju-Hong
    • The KIPS Transactions:PartD
    • /
    • v.10D no.3
    • /
    • pp.427-438
    • /
    • 2003
  • In a data warehouse, users typically search for trends, patterns, or unusual data behaviors by issuing queries interactively. The OLAP range-sum query is widely used in finding trends and in discovering relationships among attributes in the data warehouse. In a recent environment of enterprises, data elements in a data cube are frequently changed. The problem is that the cost of updating a prefix sum cube is very high. In this paper, we propose a novel algorithm which reduces the update cost significantly by an index structure called the Δ-tree. Also, we propose a hybrid method to provide either approximate or precise results to reduce the overall cost of queries. It is highly beneficial for various applications that need quick approximate answers rather than time consuming accurate ones, such as decision support systems. An extensive experiment shows that our method performs very efficiently on diverse dimensionalities, compared to other methods.

Overlapped-Subcube: A Lossless Compression Method for Prefix-Sun Cubes (중첩된-서브큐브: 전위-합 큐브를 위한 손실 없는 압축 방법)

  • 강흠근;민준기;전석주;정진완
    • Journal of KIISE:Databases
    • /
    • v.30 no.6
    • /
    • pp.553-560
    • /
    • 2003
  • A range-sum query is very popular and becomes important in finding trends and in discovering relationships between attributes in diverse database applications. It sums over the selected cells of an OLAP data cube where target cells are decided by specified query ranges. The direct method to access the data cube itself forces too many cells to be accessed, therefore it incurs severe overheads. The prefix-sum cube was proposed for the efficient processing of range-sum queries in OLAP environments. However, the prefix-sum cube has been criticized due to its space requirement. In this paper, we propose a lossless compression method called the overlapped-subcube that is developed for the purpose of compressing prefix-sum cubes. A distinguished feature of the overlapped-subcube is that searches can be done without decompressing. The overlapped-subcube reduces the space requirement for storing prefix-sum cubes, and improves the query performance.

Error Estimation about Selectivity of Approximate Range Queries in Multi-Dimensional Histogram (다차원 히스토그램에서 범위 질의의 선택도에 대한 오차 추정)

  • 정지훈;홍석진;배진욱;안성준;송병호;이석호
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.10a
    • /
    • pp.211-213
    • /
    • 2001
  • 히스토그램은 질의 최적화글 위해 사용되는 튿-계 정또 중 하나이다. 최근에는 방대한 데이타에 대한 범위 질의의 선택도 추정 방법의 하나로 사용되기도 한다. 히스토그램을 통한 범위 질의의 선택도 추정 결과는 항상 오차를 포함한다. 따라서 결과의 신뢰성을 보장하기 위해 선택도에 대한 오차를 추정하는 방법이 요구된다. 추정된 선택도의 오차 추정에 대한 기존 방법은 1차원 히스토그램만을 고려하여 하나의 애트리뷰트의 값에 따라 빈도의 분포를 반영하므로 애트리뷰트가 많은 다차원 히스토그램에 바로 적용시키는데 문제가 있다. 이 논문에서는 기존의 추정된 선택도에 대한 오차 추정 기법들을 다차원에 적용할 수 있게 확장한 M-Max, M-Sum 기법을 제안하고, 두 기법을 합친 하이브리드 기법을 제안한다. 실험을 통해 M-Sum 기법과 하이브리드 기법이 M-Max 기법보다 정확한 오차 추정 기법임을 보이고, 또한 작은 기억 공간에서도 두 기법이 오차를 보다 정확하게 추정함을 보인다.

  • PDF