Web Caching Strategy based on Documents Popularity

Yoo, Hae-Young;Park, Chel;

Journal of KIISE:Computer Systems and Theory (한국정보과학회논문지:시스템및이론)

Volume 29 Issue 9
/
Pages.530-538
/
2002
/
1229-683X(pISSN)

Korean Institute of Information Scientists and Engineers (한국정보과학회)

Web Caching Strategy based on Documents Popularity

선호도 기반 웹 캐싱 전략

Yoo, Hae-Young (Dept.of Information Computer, Dankook University) ;
Park, Chel (Dept.of Computer Science Statistics, Graduate School of Dankook University)

유해영 (단국대학교 정보컴퓨터학부) ;
박철 (단국대학교 대학원 컴퓨터과학 및 통계학과)

Published : 2002.10.01

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, we propose a new caching strategy for web servers. The proposed algorithm collects on]y the statistics of the requested file, for example the popularity, when a request arrives. And, at times, only files with higher popularity are cached all together. Because the cache remains unchanged until the cache is made newly, web server can use very efficient data structure for cache to determine whether a file is in the cache or not. This increases greatly tile efficiency of cache manipulation. Furthermore, the experiment that is performed with real log files built by web servers shows that the cache hit ratio and the cache hit ratio are better than those produced by LRU. The proposed algorithm has a drawback such that the cache hit ratio may decrease when the popularity of files that is not in the cache explodes instantaneously. But in our opinion, such explosion happens infrequently, and it is easy to implement the web servers to adapt them to such unusual cases.

본 논문에서는 파일이 요청된 순간에는 파일의 선호도만을 조사하고, 일정 시간이 흐른 후에 선호도가 높은 파일들을 일괄적으로 캐싱하는 새로운 캐싱 전략을 소개한다. 이 전략에서는 일정 기간 동안 캐시의 상태가 변하지 않기 때문에 캐시 조작 비용이 거의 들지 않는 매우 효과적인 자료 구조의 사용이 가능하다는 장점을 가지고 있다. 뿐만 아니라, 국내외 5개 웹 서버로부터 수집한 log 파일을 대상으로 실험한 결과에 의하면, LRU를 사용하였을 때보다 오히려 캐시 히트율이 증가하였으며, 캐시 내 자료 재사용율도 증가하는 장점을 보석 주고 있다. 본 논문에서 제안하는 선호도 기반 게으른 캐싱전략(Popularity Based Lazy Caching Strategy)은 캐시에 포함되지 못한 파일의 선호도가 크게 증가하는 경우에 성능이 떨어지는 단점을 가지고는 있다. 그러나 이러한 경우는 자주발생하지 않으며, 웹 서버를 적응적으로 구현하여 해결할 수 있다.

Keywords

web caching strategy

웹 캐싱 전략;

References

Internet Statistics:Web Growth, Internet Growth , http://www/mit.edu/people/mkgray/net
Bestavros. A, Matta., 'Load profiling for efficient route selection in multi-class.,' Network protocols, 1997
Bestavros. A, Cheatham. T. Jr. Stefanescu. D, 'Parallel bin packing using first fit and k- delayed best-fit,' parallel and Distributed Processing, 1990 https://doi.org/10.1109/SPDP.1990.143591
Mike Reddy & Graham P. Fletcher, 'Intelligent web caching using document life histories: A comparison with existing cache management techniques,' J228, School of Computing University of Glamorgan, Pontypridd, Mid Glamorgan. CF37 1DL,
M. R. Korupolu and M. Dahlin, 'Coordinated placement and replacement for large-scale distributed caches,' Proceedings of the IEEE Workshop on Internet Applications, July 1999 https://doi.org/10.1109/WIAPP.1999.788018
S. Williams, M. Abrams, C. R. Standridge, G. Abdulla, and E. A. Fox, 'Removal policies in network caches for World Wide Web documents,' Proceedings of Sigcomm, pp. 293-304, 1996
Igor Tatarinov, Alex Rousskov, valery Soloviev, 'Static Caching in Web Servers,' ncstrl.ndsu_cs/NDSU-CSOR-TR-97-04. 1997
안효범, 조경산, '웹 서버의 참조 특성 분석과 성능 개선', 정보처리학회논문지A, V.8-A, N.3, pp.201-208, 2001
염미령, '이중 큐 구조를 갖는 웹 서버', 정보처리학회논문지A, V.8-a, N.3, pp293-298, pp. 293-298, 2001
E.P.Markatos, 'Main Memory Caching of web documents,' Proceedings of the 5th International World Wide Web Conference, May 6-10, Paris, 1996
D.W.Chang, H.R.KE, R.C. Chang, 'Adaptive-level memory caches on World Wide Web servers,' Elsevier Computer Networks, 32, 2000, pp. 261-275 https://doi.org/10.1016/S1389-1286(99)00131-0
Stanistaw Walukiewiez, 'Integer Programming,' Polish Scientific Publishers ${\setminus}$ Warszawa, 1991

Journal of KIISE:Computer Systems and Theory (한국정보과학회논문지:시스템및이론)

Web Caching Strategy based on Documents Popularity

선호도 기반 웹 캐싱 전략

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)