DOI QR코드

DOI QR Code

QoS Guarantee in Partial Failure of Clustered VOD Server

클러스터 VOD 서버의 부분적 장애에서 QoS 보장

  • 이좌형 (강원대학교 컴퓨터정보통신공학과) ;
  • 정인범 (강원대학교 컴퓨터정보통신공학)
  • Published : 2009.06.30

Abstract

For large scale VOD service, cluster servers are spotlighted to their high performance and low cost. A cluster server usually consists of a front-end node and multiple back-end nodes. Though increasing the number of back-end nodes can result in the more QoS streams for clients, the possibility of failures in back-end nodes is proportionally increased. The failure causes not only the stop of all streaming service but also the loss of the current playing positions. In this paper, when a back-end node becomes a failed state, the recovery mechanisms are studied to support the unceasing streaming service. For the actual VOD service environment, we implement a cluster-based VOD servers composed of general PCs and adopt the parallel processing for MPEG movies. From the implemented VOD server, a video block recovery mechanism is designed on parity algorithms. However, without considering the architecture of cluster-based VOD server, the application of the basic technique causes the performance bottleneck of the internal network for recovery and also results in the inefficiency CPU usage of back-end nodes. To address these problems, we propose a new failure recovery mechanism based on the pipeline computing concept.

대용량 VOD 서비스를 위한 서버로 높은 성능과 낮은 가격의 클러스터 서버가 주목받고 있다. 일반적으로 클러스터 서버는 하나의 front-end 노드와 여러 back-end 노드로 구성된다. back-end 노드 수를 증가시키면 더 많은 클라이언트들에게 QoS를 보장하는 스트리밍 서비스를 할 수 있지만, back-end 노드의 오류 가능성도 이와 비례하여 증가한다. 서버의 장애는 모든 스트리밍 서비스를 중단시킬 뿐 아니라 현재 재생 위치 정보도 잃어버린다. 본 논문에서는 back-end 노드가 오류 상태가 될 때, 끊이지 않는 스트리밍 서비스를 지원하기 위한 복구 방법을 제안한다. 실제 VOD 서비스 환경을 위해, 일반 PC로 구성된 클러스터 기반의 VOD 서버를 구현하였으며, MPEG 영화를 위한 병렬 처리 기법을 사용하였다. 구현된 VOD 서버에 패리티 연산을 이용한 비디오 블록 복구 방법을 설계하였다. 하지만, 클러스터 기반의 VOD 서버 구조를 고려하지 않으면 복구를 위한 내부 네트워크 성능의 병목현상과 back-end 노드들의 비효율적인 CPU 사용을 야기시킨다. 본 논문에서는 이러한 문제를 해결하기 위해, 파이프라인 개념을 이용한 새로운 장애 복구 방법을 제안한다.

Keywords

References

  1. Dinkar Sitaram, Asit Dan, 'Multimedia Servers: Applications, Environments, and Design,' Morgan Kaufmann Publishers, 2000
  2. http://www.mpeg.org
  3. Armando Fox, David Patterson, 'Approaches to Recovery Oriented Computing,' IEEE Internet Computing, Vol.9, No.2, pp.14-16, 2005 https://doi.org/10.1109/MIC.2005.39
  4. Dong Tang, Ji Zhu, Roy Andrada, 'Automatic Generation of Availability Models in RAScard,' IEEE International Conference of Dependable Systems and Networks, June, 23-26, pp.488-494, 2002
  5. T. Chang, S. Shim, and D. Du, 'The Designs of RAID with XOR Engines on Disks for Mass Storage Systems,' IEEE Mass Storage Conference, March, 23-26, pp.181-186, 1998
  6. Prashant J. Shenoy, Harrick M. Vin, 'Failure recovery algorithms for multimedia servers,' Multimedia Systems, 8: pp.1-19, Springer-Verlag, 2000 https://doi.org/10.1007/s005300050001
  7. Jack Y.B. Lee, 'Supporting Server-Level Fault Tolerance in Concurrent-Push-Based Parallel Video Servers,' IEEE transactions on Circuits and Systems for Video Technology, Vol.11, No.1, pp.25-39, January, 2001 https://doi.org/10.1109/76.894280
  8. Jamel Gafsi, Ernst W. Biersack, 'Modeling and Performance Comparison of Reliability Strategies for Distributed Video Servers,' IEEE Transactions on Parallel and Distributed Systems, Vol.11, No.4, pp.412-430, 2000 https://doi.org/10.1109/71.850836
  9. 서동만, 방철석, 이좌형, 김병길, 정인범, '리눅스 기반의 클러스터 VOD 서버와 내장형에 클라이언트의 구현', 정보과학회논문지 제10권 제6호 pp.435-447, 2004
  10. Jung-Min Choi, Seung-Won Lee, Ki-Dong Chung, 'A Muticast Delivery Scheme for VCR Operations in a Large VOD System,' 8th IEEE International Conference on Parallel and Distributed Systems, pp.555-561, June, 26-29, 2001 https://doi.org/10.1109/ICPADS.2001.934866
  11. D.A. Patterson, G. Gibson, and R. H. Katz, 'A Case for Redundant Arrays of Inexpensive Disks(RAID),' Proceedings of the 1988 ACM Conferences on Management of Data, pp.109-116, June, 1988 https://doi.org/10.1145/50202.50214
  12. M. Holland, G.Gibson, and D. Siewiorek, 'Architectures and algorithms for on-line failure recovery in redundant disk arrays,' Journal of Distributed and Parallel Databases, Vol.2, pp.295-335, 1994 https://doi.org/10.1007/BF01266332
  13. Brian K. Schmidt, Monica S. Lam, J. Duane Northcutt, 'The interactive performance of SLIM: a stateless, thin-client architecture,' ACM SOSP'99, pp.31-47, 1999 https://doi.org/10.1145/319151.319154
  14. W.C. Feng and M. Lie, 'Critical Bandwidth Allocation Techniques for Stored Video Delivery Across Best-Effort Networks,' 20th International Conference on Distributed Computing Systems, pp.201-207, April, 2000 https://doi.org/10.1109/ICDCS.2000.840907
  15. Jung-Min Choi, Seung-Won Lee, Ki-Dong Chung, 'A Muticast Delivery Scheme for VCR Operations in a Large VOD System,' 8th IEEE International Conference on Parallel and Distributed Systems, pp.555-561, June, 26-29, 2001 https://doi.org/10.1109/ICPADS.2001.934866
  16. David A. Patterson and John L. Hennessy, 'Computer Organization & Design,' pp.392-490, Morgan Kaufmann, 1998
  17. Nabil J. Sarhan, Chita R. Das, 'Caching and Scheduling in NAD-Based Multimedia Servers,' IEEE Transactions on PARALLEL AND DISTRIBUTED SYSTEMS, Vol.15, No.10, pp.921-933, 2004 https://doi.org/10.1109/TPDS.2004.49
  18. Sang-Ho Lee, Kyu-Young Whang, Yang-Sae Moon, Wook-Shin Han, 'Dynamic Buffer Allocation in Video-on- Demand Systems,' IEEE Transactions on PARALLEL AND DISTRIBUTED SYSTEMS, Vol.15, No.6, pp.1535-1551, 2003 https://doi.org/10.1109/TKDE.2003.1245291
  19. Sooyong Kang, Heon Y. Yeom, 'Modeling the Caching Effect in Continuous Media Servers,' Multimedia Tools and Applications, 23(3), pp 203-224, 2003 https://doi.org/10.1023/A:1025702332314
  20. J. Gafsi and E.W. Biersack, 'Data Striping and Reliablity Aspects in Distributed Video Servers,' In Cluster Computing: Networks, Software Tools, and Applications, 2(1): pp.75-91, February, 1999 https://doi.org/10.1023/A:1019054003646
  21. W.J. Bolosky, R.P. Pitzgerald, J.H. Draves, 'Distributed schedule management in the Tiger video fileserver,' Proceedings of the sixteenth ACM symposium on Operating systems principles, Saint Malo France, October, 05-08, pp.212-223, 1997 https://doi.org/10.1145/269005.266692