Efficient Storage Techniques for Materialized Views Using Multi-Zoned Disks in OLAP Environment

OLAP 환경에서 다중 존 디스크를 활용한 실체뷰의 효율적 저장 기법

  • 장재영 (한성대학교 컴퓨터공학과)
  • Published : 2009.02.28

Abstract

In determining the performance of OLAP database applications, the structure and the effective access methods to the underlying disk system is a significant factor. In recent years, hard disks are designed with multiple physical zones where seek times and data transfer rates vary across the zones. However, there is little consideration of multi-zone disks in previous works. Instead, they assumed a traditional disk model that comes with many simplifying assumptions such as an average seek-time and a single data transfer rate. In this paper, we propose a technique storing a set of materialized views into the multi-zoned disks in OLAP environment dealing with large sets of data. We first present the disk zoning algorithm of materialized views according to the access probabilities of each views. Also, we address the problem of storing views in the dynamic environment where data are updated continuously. Finally, through experiments, we prove the performance improvement of the proposed algorithm against the conventional methods.

대용량의 데이타를 다루는 OLAP 데이타베이스 환경에서는 기반 디스크 시스템의 구조와 효율적 접근방법이 전체적인 성능을 좌우하는 중요한 요소가 된다. 최근 들어 하드 디스크들은 여러 개의 물리적 존을 갖는 구조로 설계되고 있는데, 각 존들은 그 위치에 따라 다양한 탐색시간과 데이타 전송률을 갖는 특징을 갖고 있다. 그러나 기존 연구에서는 다중 존을 고려하지 않은 하나의 탐색시간과 데이타 전송률을 갖는 단순한 디스크 모델에 기반을 두고 진행되어 왔다. 본 논문에서는 대용량의 데이타를 다루는 OLAP 환경에서 주어진 실체뷰 집합을 다중 존에 효율적으로 저장하는 기법을 제안한다. 이를 위해 각 실체뷰를 접근확률에 따라 디스크 존에 배치하는 알고리즘을 제시하고, 데이타 지속적으로 갱신되는 동적 환경에서의 저장 방법에 대해서도 살펴본다. 마지막으로 실험을 통하여 본 논문에서 제시된 알고리즘을 효율성을 증명한다.

Keywords

References

  1. V. Harinarayan, Rajaraman, A., and Ullman, J., “Implementing Data Cubes Efficiently,” Proc. of ACM SIGMOD, 1996.
  2. H. Gupta, “Selection of Views to Materialize in a Data Warehouse,” Proc. of ICDT, 1997, pp. 98-112.
  3. H. Gupta, Harinarayan, V., Rajaraman, A., and Ullman, J., “Index Selection for OLAP,” Proc. of ICDE, 1997, pp. 208-219.
  4. Y. Kotidis and Roussopoulos, N., “Dynamat: A Dynamic View Management System for Data Warehouses,” Proc. of ACM SIGMOD, 1999, pp. 371-382.
  5. H. Mistry, P. Roy, S. Sudarshan, K. Ramamritham, “Materialized View Selection and Maintenance Using Multi-Query Optimization,” Proc. of ACM SIGMOD, 2001, pp. 310-318.
  6. A. Shukla, Deshpande, P. M., and Nau ghton, J. F., “Materialized View Selection for Multidimensional DataSet,” Proc. of VLDB, 1998, pp. 488-499,
  7. S. Agrawal, Chaudhuri, S., and Narasayya, V., “Automated Selection of Materialized Views and Indexes for SQL databases,” Proc. of VLDB, 2001, pp. 59-68.
  8. C. A. Dhote and ALi, M. S., “Materialized View Selection in Data Warehouse,” Proc. of International Conference on Information Technology, 2007.
  9. C. Ruemmler. and Wilkes, J., “An Introduction to Disk Drive Modeling,” IEEE Computer, March 1994.
  10. S. W. Ng, “Advances in Disk Technology: Performance Issues,” IEEE Computer Magazine, 1998, pp. 75-81.
  11. Oracle and 3PAR. “Simplified database storage management that lowers management costs and yields high storage utilization”. White paper, http://www.oracle.com/technology/products/database/asm/pdf/oracle_ 3par_wp_final.pdf 2004.
  12. Yu, B. and Kim, S.-H. “An Efficient Zoning Technique for Multidimensional Access Methods,” Proc. the VLDB Workshop on Trends in Enterprise Application Architecture, LNCS 3888, 2005, pp. 129-143.
  13. Yu, B. and Kim, S.-H. “Zoning Multidimensional Access Methods for Analytical Database Applications,” Proc. the 3rd International Conference on Computer Science and its Applications, 2005, pp. 191-196.
  14. Kim, S.-H., Yu, B., and Chang, J.-Y., “Zoned-partitioning of tree-like access methods,” Information Systems, Vol. 33, 2008, pp. 315-331. https://doi.org/10.1016/j.is.2007.11.001
  15. Seagate, Seagate Cheetah X15 FC disk drive ST318451FC/FCV product manual, Vol. 1, Document number 83329486, June 2000.
  16. Seagate, Barracuda 7200.7 Plus Serial ATA Publication number:100270024, Publication number:100270024, Rev. N, September 2005.