• Title/Summary/Keyword: fault tolerant system

Search Result 422, Processing Time 0.029 seconds

Intrusion Fault-Tolerant System Model in Grid Computing (그리드 컴퓨팅에서 침입감내기술 적용 방안)

  • 이진성;최창열;박기진
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.04a
    • /
    • pp.925-927
    • /
    • 2004
  • 침입감내기술이란 중요한 서비스를 제공하는 시스템에 의도적인 공격이나 의도적이지 않은 결함이 발생하더라도 정상적인 서비스를 제공할 수 있는 기술을 말한다. 즉 침입감내기술은 기존의 침입차단이나 탐지기술에 의하여 해결될 수 없었던 익명의 취약성을 이용하는 공격에 의해 발생하는 시스템의 피해를 방지하며 최소화하기 위한 기술이라 할 수 있다. 본 논문에서는 고속 네트워크와 이질적인 자원의 결합으로 구성되어 보안에 취약할 수밖에 없는 그리드 컴퓨팅 환경에서 기존의 정보보호기술과 결함허용기술을 이용하여 안전하게 서비스를 지속할 수 있는 그리드 시스템 모델을 제시하고 제안된 시스템의 타당성을 살펴보았다.

The Design of Fault Tolerant VoD System (결함 허용성을 고려한 VoD 시스템 설계)

  • 박서림;이승원;정기동
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.04a
    • /
    • pp.223-225
    • /
    • 2001
  • 본 논문은 분산 환경에서 결함 허용성 있는 VoD서비스를 위한 결함 처리 방법을 제안하고 성능을 측정하였다. 디스크 결함, 서버 결함, 시스템 결함에 따른 결함 체크와 처리 방법을 제시하였다. 사용자에게 신뢰성과 실시간성을 보장하기 위한 서버 스위칭 방법을 제안하였다. 서버 스위칭 방법은 결함이 발생하지 않은 서버들의 여유 로드율에 따라 달라진다. 서버 스위칭이 일어나는 동안 클라이언트의 버퍼에 적정량의 데이터를 유지하기 위한 흐름제어 기법도 제시하였다. 본 논문에서 제안하는 방법을 통해 결함일 발생함에도 불구하고 클라이언트는 서비스의 중단 없이 VoD 서비스르 제공 받을 수 있다.

How To Support Scalability in Causal Message Logging (인과적 메시지 로깅에서 확장성 지원 방법)

  • Kim, Ki-Bom;Hwang, Chung-Sun;Yu, Heon-Chang;Shon, Jin-Gon;Jung, Soon-Young
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.27 no.4
    • /
    • pp.362-372
    • /
    • 2000
  • The causal message logging is a low-cost technique of building a distributed system that can tolerate process crash failures. Previous research in causal message logging protocol assumes that the number of processes in a fault-tolerant system is fixed. This assumption makes all processes modify their data structures when a new process is added or an existing process terminates. However, the proposed approach in this paper allows to each process retain identifiers of only the communicating processes instead of all processes. This mechanism enables the fault-tolerant system to operate at many different scales. Using this mechanism, we develop a new algorithm that can be adapted for recovery in existing causal message logging protocols. Our recovery algorithm is 1) a distributed technique which does not require recovery leader, 2) a nonblocking protocol which does not force live processes to block while recovery is in progress, and 3) a novel mechanism which can tolerate failures of an arbitrary number of processes. Earlier causal message logging protocols lack one or more of the above properties.

  • PDF

A Novel Implementation of Fault-Tolerant Ethernet NIC (Network Interface Card) Using Single MAC (단일 MAC을 이용한 자동 고장 극복 Ethernet NIC (Network Interface Card) 장치 구현)

  • Kim, Se-Mog;Pham, Hoang-Anh;Lee, Dong-Ho;Rhee, Jong Myung
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.37C no.11
    • /
    • pp.1162-1169
    • /
    • 2012
  • One of the important operational requirements for mission critical Ethernet networked system is having the fault tolerant capability. Such capability can be obtained by equipping multiport Network Interface Card (NIC) in each node in the system. Conventional NIC uses two or more Media Access Controls (MACs) and a co-processor for the MAC switching whenever an active port fails. Since firmware is needed for the co-processor, longer fail-over switching and degraded throughput can be generally expected. Furthermore the system upgrading requiring the firmware revision in each tactical node demands high cost. In this paper we propose a novel single MAC based NIC that does not use a co-processor, but just use general discrete building blocks such as MAC chip and switching chip, which results in better performances than conventional method. Experimental results validate our scheme.

Techniques to Guarantee Real-Time Fault Recovery in Spark Streaming Based Cloud System (Spark Streaming 기반 클라우드 시스템에서 실시간 고장 복구를 지원하기 위한 기법들)

  • Kim, Jungho;Park, Daedong;Kim, Sangwook;Moon, Yongshik;Hong, Seongsoo
    • Journal of KIISE
    • /
    • v.44 no.5
    • /
    • pp.460-468
    • /
    • 2017
  • In a real-time cloud environment, the data analysis framework plays a pivotal role. Spark Streaming meets most real-time requirements among existing frameworks. However, the framework does not meet the second scale real-time fault recovery requirement. Spark Streaming fault recovery time increases in proportion to the transformation history length called lineage. This is because it recovers the last state data based on the cumulative lineage recorded during normal operation. Therefore, fault recovery time is not bounded within a limited time. In addition, it is impossible to achieve a second-scale fault recovery time because it costs tens of seconds to read initial state data from fault-tolerant storage. In this paper, we propose two techniques to solve the problems mentioned above. We apply the proposed techniques to Spark Streaming 1.6.2. Experimental results show that the fault recovery time is bounded and the average fault recovery time is reduced by up to 41.57%.

Design of Scheduler Considering Real-Time Characteristic and Fault-Tolerant in Embedded System (임베디드 시스템에서 실시간성과 결함허용을 보장하는 스케줄러 설계)

  • Jeon, Tae-Gun;Kim, Chang-Soo
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.1
    • /
    • pp.76-84
    • /
    • 2011
  • Embedded Systems need to ensure real-time of the task response time depending on the applied fields of it. And task could be faulty due to various reasons in real time systems. Therefore in this paper, we design a task scheduler that guarantees deadlines of periodic tasks and considers a fault tolerance of defective task in embedded system with a single processor. In order to provide real-time, we classify tasks with periodic/aperiodic tasks and applies RMS(Rate Monotonic Scheduling) method to schedule periodic tasks and can guarantees execution of aperiodic tasks by managing surplus times obtained after analyzing the execution time of periodic tasks. In order to provide fault tolerance, we manage backup times and reexecute a fault task to restore it's conditions.

Voting System Bus Protocol for a Highly-Reliable PLC with Redundant Modules (다중화 구조 고신뢰성 제어기기를 위한 보팅 시스템버스 프로토콜)

  • Jeong, Woohyuk;Park, Jaehyun
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.20 no.6
    • /
    • pp.689-694
    • /
    • 2014
  • An SPLC (Safety Programmable Logic Controller) must be designed to meet the highest safety standards, IEEE 1E, and should guarantee a level of fault-tolerance and high-reliability that ensures complete error-free operation. In order to satisfy these criteria, I/O modules, communication modules, processor modules and bus modules of the SPLC have been configured in triple or dual modular redundancy. The redundant modules receive the same data to determine the final data by the voting logic. Currently, the processor of each rx module performs the voting by deciding on the final data. It is the intent of this paper to prove the improvement on the current system, and develop a voting system for multiple data on a system bus level. The new system bus protocol is implemented based on a TCN-MVB that is a deterministic network consisting of a master-slave structure. The test result shows that the suggested system is better than the present system in view of its high utilization and improved performance of data exchange and voting.

Hardware and Software Dependability Analysis of Embedded AVTMR(All Voting Triple Modular Redundancy) System (내장형 AVTMR 시스템의 하드웨어 및 소프트웨어 신뢰성 분석)

  • Kim, Hyun-Ki
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.34 no.7B
    • /
    • pp.744-750
    • /
    • 2009
  • In this paper, the unified Markov modeling of hardware and software for AVTMR(AlI Voting Triple Modular Redundancy) system is proposed and the dependability is analyzed. In hardware case, a failure rate is fixed to no time varying parameter. But, in software case, failure rate is applied with time varying parameter. Especially, the dependability(Reliability, Availability, Maintainability, Safety) of software is analyzed with G-O/NHPP for Markov modeling. The dependability of single and AVTMR system is analyzed and simulated with a unified Markov modeling method, and the characteristic of each system is compared accroding to failure rate. This kind of fault tolerat system can be applied to an airplane and life critical system to meet the requirement for a specific requirement.

Improving Availability of Embedded Systems Using Memory Virtualization

  • Son, Sunghoon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.5
    • /
    • pp.11-19
    • /
    • 2022
  • In this paper, we propose a fault tolerant embedded system using memory redundancy on the full-virtualization based virtual machine monitor. The proposed virtual machine monitor first virtualizes main memory of embedded system utilizing efficient shadow page table scheme so that the embedded system runs as a virtual machine on the virtual machine monitor. The virtual machine monitor makes the backup of the embedded system run as another virtual machine by copying memory contents of the embedded system into memory space of backup system according to predefined schedules. When an error occurs in the target virtual machine, the corresponding standby virtual machine takes the role of target virtual machine and continues its operation. Performance evaluation studies show that such backups and switches of virtual machines are performed with minor performance degradation.

An Adaptive Unknown Input Observer based Actuator Fault Diagnosis (적응 미지입력 관측기에 근거한 구동기 고장의 식별)

  • Park, Tae-Geon;Ryu, Ji-Su;Lee, Kee-Sang
    • Proceedings of the KIEE Conference
    • /
    • 1999.07b
    • /
    • pp.665-667
    • /
    • 1999
  • An adaptive algorithm is presented for diagnosis of actuator faults. The concept of unknown input decoupling is combined with an adaptive observer, leading to an adaptive diagnostic observer, which has the robustness property in the presence of an unmeasurable term such as uncertainties. The observation error equation for the adaptive diagnostic observer does not depend on the effect of uncertainties and used to construct an adaptive diagnostic algorithm that provides the estimates of the gains of actuators, which can be obtained directly via the use of the augmented error technique. The simulation results indicate that the proposed algorithm is more realistic in the sense that better robustness properties can be assured without knowledge about uncertainties and is potentially useful in the development of a fault tolerant control system.

  • PDF