File Access Pattern Collection Scheme based on Repetitiveness

반복성을 고려한 파일 액세스 패턴 수집 기법

  • 황보준형 (경북대학교 전자공학과) ;
  • 석성우 (한국전자통신연구원 컴퓨터 소프트웨어 연구소) ;
  • 서대화 (경북대학교 전자전기공학부)
  • Published : 2001.12.01

Abstract

This paper presents the SIC(Size-Interval-Count) prefetching scheme that can record the file access patterns of applications within a relatively small space of memory based on the repetitiveness of the file access patterns. Several knowledge-based prefetching methods were recently introduced, which includes high correctness in predicting future accesses of applications. They records the access patterns of applications and uses recorded access pattern information to predict which blocks will be requested next. Yet, these methods require to much memory space. Accordingly, the proposed method then uses the recorded file access patterns, referred to as "SIC access pattern information", to correctly predict the future accesses of the applications. The proposed prefetching method improved the response time by about 40% compared to the general file system and showed remarkable memory efficiency compared to the previously knowledge-based prefetching methods.

본 논문에서는 액세스 패턴의 반복성을 이용하여 비교적 적은 메모리 공간을 사용하는 SIC (SiZe-Interval-Count) 선반입 기법을 제안한다. 최근에 연구되어진 지식기반의 선반입 기법은 응용프로그램의 액세스를 예측하여 정확한 선반입을 수행하는 기법이다. 이들 기법은 응용프로그램의 액세스 패턴을 기록하고, 기록된 액세스 패턴정보를 이용하여 다음에 요청될 블록을 예측하게 된다. 하지만 이 기법은 많은 메모리 공간의 사용을 필요로 한다. 따라서 제안된 선반입 기법에서는 "SIC 액세스 패턴 정보"를 이용하여 반복적인 액세스 패턴을 효율적으로 저장하고, 이를 이용하여 응용프로그램의 다음에 요청될 블록을 정확하게 예측한다. 본 논문의 선반입 기법은 일반 파일시스템에 비해 최고 40%의 응답속도 향상을 가져오며, 기존의 지식기반 선반입 기법에 비해 뛰어난 메모리 효율성을 보여준다.

Keywords

References

  1. L. Breslau, P. Cao, L. Fan, G. Phillips and S. Shenker, 'Web Caching and Zipf-like Distributions: Evidence and Implications.' In Proc. of IEEE Infocom'99, pp. 126-134, March 1999 https://doi.org/10.1109/INFCOM.1999.749260
  2. T. M. Madhyastha, 'Automatic Classification of Input/Output Access Patterns. Tech. Rep.,' University of Illinois at Urbana-Champaign, Department of Computer Science, August 1997
  3. J. K. Ousterhout, H. Da Costa, D. Harrison, J. A. Kunze, M. Kupfer, and J. G. Thompson, 'A Trace-Driven Analysis of the UNIX 4.2 BSD File System,' In Proc. of the 10th Symposium on Operating System Principles, pp. 15-24, December 1985 https://doi.org/10.1145/323627.323631
  4. N. Nieuwejaar, D. Kotz, A. Purakayastha, C. S. Ellis, and M. Best, 'File-access characteristics of parallel scientific workloads,' IEEE Transactions on Parallel and Distributed Systems, 7(10):1075-1089, October 1996 https://doi.org/10.1109/71.539739
  5. A. Purakayastha, C. S. Ellis, D. Kotz, N. Nieuwejaar, and M. Best, 'Characterizing Parallel File-Access Patterns on a Large-Scale Multiprocessor,' I Proc. of the Ninth International Parallel Processing Symposium, pages 165-172, April, 1995 https://doi.org/10.1109/IPPS.1995.395928
  6. D. Kotz and C.S. Ellis, 'Practical prefetching techniques for multiprocessor file systems,' Journal of Distributed and Parallel Databases, 1(1):33-51, January 1993 https://doi.org/10.1007/BF01277519
  7. T. M. Madhyastha and D. A. Reed, 'Exploiting Global Input/Output Access Pattern Classification,' In Proc. of SC'97, November 1997, CD-ROM https://doi.org/10.1109/SC.1997.10016
  8. T. M. Madhyastha and D. A. Reed, 'Input/Output Access Pattern Classification Using Hidden Markov Models,' In Proc. of the Workshop on Input/Output in Parallel and Distributed Systems(IOPADS), November 1997 https://doi.org/10.1145/266220.266226
  9. R.H. Patterson, G.A. Gibson, E. Ginting, D. Stodolsky,and J. Zelenka, 'Informed prefeching and caching,' In Proc. of the Fifteenth ACM Symposium on Operating Systems Principles, pp. 79-95, December 1995
  10. A. Tomkins, R.H. Patterson and G.A. Gibson, 'Informed Multi-Process Prefetching and Caching,' I Proc. of the ACM International Conference on Measurement and Modeling of Computer Systems(SIGMETRICS), June 1997 https://doi.org/10.1145/224057.224064