Efficient Processin of Queries with Joints and Aggregate Functions in ROLAP Data Warehousing Environment

관계형 OLAP 데이터 웨어하우징 환경에서 조인과 집계함수를 포함하는 질의의 효율적인 처리

  • Kim, Jin-Ho (Department of Computer Science Kangwon National University) ;
  • Kim, Yun-Ho (Department of Computer, Information, and Communications Engineering, Kangwon National University) ;
  • Kim, Sang-Wook (Department of Computer, Information, and Communications Engineering, Kangwon National University)
  • 김진호 (江原大學敎 電子計算學科) ;
  • 김윤호 (江原大學敎 컴퓨터 情報通信工學部) ;
  • 김상욱 (江原大學敎 컴퓨터 情報通信工學部)
  • Published : 2002.09.01

Abstract

Efficient processing of expensive queries that include joins and/or aggregate functions is crucial in data warehousing environment since there reside enormous volume of data. In this paper, we propose a new method for processing of queries that have both of joins and aggregate functions. The proposed method first performs grouping of the dimension table and then processes join by using the bitmap join index. This makes only the fact table accessed for processing aggregate functions, and thus resolves the serious performance degradation of the existing method. For showing the superiority of the proposed method, we suggest the cost models for the proposed and existing ones, and perform extensive simulations based on the TPC-H benchmark.

대용량의 데이터가 저장되는 데이터 웨어하우징 환경에서 조인이나 집계 함수와 같은 고비용의 연산의 효율적인 처리는 매우 중요하다. 본 논문에서는 집계 함수(aggregate function)와 조인(join)이 모두 포함된 질의를 처리하는 새로운 기법을 제안한다. 제안하는 기법은 먼저 차원 테이블(dimension table)을 미리 그루핑한 후, 비트맵 조인 인덱스(bitmap join index)를 이용하여 조인을 처리하는 방식을 사용한다. 이 결과, 사실 테이블(fact table)만을 접근하여 집계 함수를 처리함으로써 기존 기법이 가지는 성능 저하의 문제점을 해결할 수 있다. 기존 기법과 제안하는 기법에 대한 비용 모델(cost model)을 정립하고, TPC-H 벤치마크를 기반으로 하는 다양한 시뮬레이션을 수행함으로써 제안된 기법의 우수성을 규명한다.

Keywords

References

  1. C.Y. Chan and Y.E. Ioannidis, 'Bitmap Index Design and Evaluation,' In Proc. Int'l. Conf. on Management of Data, ACM SIGMOD, pp. 355-366, Seattle, Washington, USA, June 1998 https://doi.org/10.1145/276305.276336
  2. S. Chaudhuri and K. Shim, 'Optimizing Queries with Aggregate Views,' In Proc. Int'l. Conf. on Extending Database Technology, pp. 167-182, Avignon, France, March 1996
  3. W.D. Frazer and C.K. Wong, 'Sorting by Natural Selection,' Communications of the ACM, Vol. 15, No. 10, pp. 910-913, October 1972 https://doi.org/10.1145/355604.361597
  4. A. Gupta, V. Harinarayan, and D. Quass, 'Aggregate-Query Processing in Data Warehousing Environments,' In Proc. Int'l. Conf. on Very Large Data Bases, pp. 358-369, Zurich, Switzerland, September 1995
  5. Goetz Graefe, 'Query Evaluation Techniques for Large Databases,' ACM Computing Surveys, Vol. 25, No. 2, pp. 73-170, 1993 https://doi.org/10.1145/152610.152611
  6. W.H. Inmon and R.D. Hackathorn, Using the Data Warehouse, John Wiley & Sons, 1994
  7. Informix White Paper, Informix Decision Support Indexing for the Enterprise Data Warehouse, 1998
  8. W.H. Inmon, Building the Data Warehouse, John Wiley & Sons, March 1996
  9. R. Kimball, The Data Warehouse Toolkit, John Wiley & Sons, 1996
  10. M. Muralikrishna, 'Improved Unnesting Algorithms for Join Aggregate SQL Queries,' In Proc. Int'l. Conf. on Very Large Data Bases, pp. 91-102, Vancouver, Canada, August 1992
  11. P. O'Neil and D. Quass, 'Improved Query Performance with Variant Indexes,' In Proc. Int'l. Conf. on Management of Data, ACM SIGMOD, pp. 38-49, Tucson, Arizona, USA, May 1997 https://doi.org/10.1145/253260.253268
  12. Transaction Processing Performance Council (TPC), TPC Benchmark H (Decision Support), Standard Specification Revision 1.2.1, 1999
  13. M.C. Wu and A. Buchmann, 'Encoded Bitmap Indexing for Data Warehouses,' In Proc. Int'l. Conf. on Data Engineering, IEEE, pp. 220-230, Orlando, Florida, USA, February 1998
  14. M.C. Wu, 'Query Optimization for Selections using Bitmaps,' In Proc. Int'l. Conf. on Minagement of Data, ACM SIGMOD, pp. 227-238, Philadephia, Pennsylvania, USA, June 1999
  15. W.P. Yan and P.A. Larson, 'Eager Aggregation and Lazy Aggregation,' In Proc. Int'l Conf. on Very Large Data Bases, pp. 345-357, Zurich, Switzerland, September 1995