Improving Flash Translation Layer for Hybrid Flash-Disk Storage through Sequential Pattern Mining based 2-Level Prefetching Technique

하이브리드 플래시-디스크 저장장치용 Flash Translation Layer의 성능 개선을 위한 순차패턴 마이닝 기반 2단계 프리패칭 기법

  • Received : 2010.09.27
  • Accepted : 2010.11.05
  • Published : 2010.11.30

Abstract

This paper presents an intelligent prefetching technique that significantly improves performance of hybrid fash-disk storage, a combination of flash memory and hard disk. Since flash memory embedded in a hybrid device is much faster than hard disk in terms of I/O operations, it can be utilized as a 'cache' space to improve system performance. The basic strategy for prefetching is to utilize sequential pattern mining, with which we can extract the access patterns of objects from historical access sequences. We use two techniques for enhancing the performance of hybrid storage with prefetching. One of them is to modify a FAST algorithm for mapping the flash memory. The other is to extend the unit of prefetching to a block level as well as a file level for effectively utilizing flash memory space. For evaluating the proposed technique, we perform the experiments using the synthetic data and real UCC data, and prove the usability of our technique.

본 논문은 플래시 메모리와 하드디스크로 구성되는 하이브리드 저장장치의 성능을 높이기 위한 프리패칭 기법을 제안한다. 하이브리드 저장장치에 포함된 플래시 메모리는 하드디스크에 비해 쓰기/읽기 연산 속도가 상대적으로 빠르기 때문에 이를 캐시 공간처럼 활용하여 성능을 높일 수 있다. 프리패칭을 위한 기본 전략은 순차패턴 마이닝을 이용하는 것이며, 이를 이용하면 시간적 흐름을 가지는 과거 객체 참조열로부터 반복되는 객체 접근 패턴을 추출할 수 있다. 프리패칭 기법을 사용하여 하이브리드 저장장치의 성능을 최대화하기 위하여 본 논문은 두 가지 방법을 사용하였다. 첫 번째는 플래시 메모리 매핑을 위하여 기존의 FAST 알고리즘을 개선하였고, 두 번째는 제한된 플래시 메모리의 공간을 효율적으로 사용하기 위하여 프리패칭 단위로 파일 수준과 블록 수준을 동시에 고려하였다. 제안 기법의 효용성을 평가하기 위해 참조 지역성을 가지는 합성 데이터와 UCC 데이터를 활용하여 실험을 실시하여 제안된 방법의 우수성을 증명하였다.

Keywords

References

  1. Agrawal R., Srikant R., "Mining sequential patterns," Proceedings of the 11th International Conference on Data Engineering (ICDE'95), 1995, pp. 3-14.
  2. Agrawal R. S. R., "Mining sequential patterns : Generalizations and performance improvements," Proceedings of the Fifth International Conference On Extending Database Technology(EDBT '96), 1996, pp. 3-17.
  3. Bae Y. H., "Design Technique of High Performance Flash Memory SSD," Journal of Korean Institute of Information Scientists and Engineers, Vol. 25, No. 6, 2007, pp. 18-28.
  4. Chiang M. L., Lee Paul C. H. and Chang R. C., "Using data clustering to improve cleaning performance for flash memory," Software Practiceand Experience, Vol. 29, No. 3, 1999, pp. 267-290. https://doi.org/10.1002/(SICI)1097-024X(199903)29:3<267::AID-SPE233>3.0.CO;2-T
  5. Ezeife C. I. and Lu Y., "Mining web log sequential patterns with position coded pre-order linked wap-tree," International Journal of Data Mining and Knowledge Discovery, Vol. 10, No. 1, 2005, pp. 5-38. https://doi.org/10.1007/s10618-005-0248-3
  6. Gal E. and Toledo S., "Algorithms and Data Structures for Flash Memories," ACM Computing Surveys, Vol. 37, No. 2, 2005, pp. 138-163. https://doi.org/10.1145/1089733.1089735
  7. Han J., Pei J., Mortazavi-Asl B., Chen Q., Dayal U., and Hsu M., Freespan : Frequent pattern-projected sequential pattern mining, Proceedings of the 2000 Int. Conference on Knowledge Discovery and Data Mining(KDD'00), 2000, pp. 355-359.
  8. Kim H. J. and Lee S. G., "An Effective Flash Memory Manager for Reliable Flash Memory Management," IEICE Transactionson Information and Systems, Vol. 85, No. 6, 2002, pp. 950-964.
  9. Lee S. W., Park D. J., Ching T. S. Lee D. H., Park S. W. and Song H. J., "A Log Buffer-Based Flash Translation Layer Using Fully-Associative Sector Translation," ACM Transactions on Embedded Computing Systems, Vol. 6, No. 3, 2007, pp. 18-44. https://doi.org/10.1145/1275986.1275990
  10. Li Z., Chen Z., Srinivasan S. M. and Zhou Y., "C-Miner : Mining Block Correlations in Storage Systems," Proceedings of the 3rd USENIX Conference on File and Storage Technology (FAST'04), 2004, pp. 173-186.
  11. Markatos E. P. and Chronaki C., "A Top-10 Approach to Prefetching on the Web," Proceedings of the INET 98 Conference, 1998.
  12. Min S. L. and Nam E. H., "Current trends in flash memory technology : invited paper," Proceedings of the 2006, conference on Asia South Pacific design automation, 2006, pp. 332-333.
  13. Masseglia F., Poncelet P. and Cicchetti R., "An efficient algorithm for web usage mining," Networking and Information Systems Journal, Vol. 2, No. 5-6, 1999, pp. 571-603.
  14. Pei J., Han J., Mortazavi-Asl B. and Zhu H., "Mining access patterns efficiently from web logs," Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'00), 2000, pp. 396-407.
  15. Tan P. N., Steinbach M. and Kumar V., "Introduction to Data Mining," Addison-Wesley, 2006.
  16. Zaki M. J., "SPADE : An efficient algorithm for mining frequent sequences," Machine Learning, Vol. 42, No. 1/2, pp. 31-60.
  17. Hybrid drive : Wikipedia, http://en.wikipedia.org/wiki/Hybrid_drive.