• Title/Summary/Keyword: Fault-Tolerant System

Search Result 422, Processing Time 0.025 seconds

Robust Backup Path Selection in Overlay Routing with Bloom Filters

  • Zhou, Xiaolei;Guo, Deke;Chen, Tao;Luo, Xueshan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.7 no.8
    • /
    • pp.1890-1910
    • /
    • 2013
  • Routing overlay offers an ideal methodology to improve the end-to-end communication performance by deriving a backup path for any node pair. This paper focuses on a challenging issue of selecting a proper backup path to bypass the failures on the default path with high probability for any node pair. For existing backup path selection approaches, our trace-driven evaluation results demonstrate that the backup and default paths for any node pair overlap with high probability and hence usually fail simultaneously. Consequently, such approaches fail to derive a robust backup path that can take over in the presence of failure on the default path. In this paper, we propose a three-phase RBPS approach to identify a proper and robust backup path. It utilizes the traceroute probing approach to obtain the fine-grained topology information, and systematically employs the grid quorum system and the Bloom filter to reduce the resulting communication overhead. Two criteria, delay and fault-tolerant ability on average, of the backup path are proposed to evaluate the performance of our RBPS approach. Extensive trace-driven evaluations show that the fault-tolerant ability of the backup path can be improved by about 60%, while the delay gain ratio concentrated at 14% after replacing existing approaches with ours. Consequently, our approach can derive a more robust and available backup path for any node pair than existing approaches. This is more important than finding a backup path with the lowest delay compared to the default path for any node pair.

Paper Duplication Method Supported by Task (태스크 기반 이중화 방안)

  • Lee, Jong-Chan;Park, Sang-Joon;Kang, Kwon-Il
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.27 no.1C
    • /
    • pp.103-111
    • /
    • 2002
  • In RNC of IMT-2000, main control processors such as ASP, ACP and OMP are responsible for call control function, and the high reliability and real-time property should be provided for it. So, the study of real-time fault-tolerant for it is needed. In this paper, we proposes an Task based duplication method, in which Tasks in active side operated on message unit and send the updated data to standby side after operation, log in the message to standby side for recovery during take-over. This scheme decreases the dual down and the complexity of synchronization procedure, and performs the synchronization more exactly because Tasks control the synchronization of system. This paper also proposes the fault detection and the fault handing method for effective implementation of Task based duplication. This scheme focus on increasing the fault detection rate and intercepting originally that fault data is send to standby side.

Reliability Analysis of a System with Redundancy Management Based on Monte-Carlo Probability Model (다중구조관리자 특성이 반영된 확률모델 기반의 몬테카를로 신뢰도 해석 기법 연구)

  • Kim, Sung-Su;Park, Sang-Hyuk;Kim, Sung-Hwan;Choi, Kee-Young;Park, Choon-Bae;Ha, Cheol-Keun
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.17 no.11
    • /
    • pp.1132-1137
    • /
    • 2011
  • Critical systems with high reliability feature fault tolerant redundancy. Conventional analytical reliability analysis methods that use the Reliability Block Diagram do not adequately reflect characteristics of the redundancy management system and are not suitable for this applications. This paper uses Monte-Carlo method to calculate the reliability of complicated redundant systems. The method was first validated for cases with analytical solutions. Then, the tool was successfully applied to analyze reliability of the flight control systems with a voter as redundancy management system.

Concurrency Control Method to Provide Transactional Processing for Cloud Data Management System

  • Choi, Dojin;Song, Seokil
    • International Journal of Contents
    • /
    • v.12 no.1
    • /
    • pp.60-64
    • /
    • 2016
  • As new applications of cloud data management system (CDMS) such as online games, cooperation edit, social network, and so on, are increasing, transaction processing capabilities for CDMS are required. Several transaction processing methods for cloud data management system (CDMS) have been proposed. However, existing transaction processing methods have some problems. Some of them provide limited transaction processing capabilities. Some of them are hard to be integrated with existing CDMSs. In this paper, we proposed a new concurrency control method to support transaction processing capability for CDMS to solve these problems. The proposed method was designed and implemented based on Spark, an in-memory distributed processing framework. It uses RDD (Resilient Distributed Dataset) model to provide fault tolerant to data in the main memory. In our proposed method, database stored in CDMS is loaded to main memory managed by Spark. The loaded data set is then transformed to RDD. In addition, we proposed a multi-version concurrency control method through immutable characteristics of RDD. Finally, we performed experiments to show the feasibility of the proposed method.

Design of AVTMR system and Evaluation of RAM (Reliability, Availability, Maintainability) (AVTMR 시스템의 설계 및 RAM 평가)

  • 김현기;이기서
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.25 no.12B
    • /
    • pp.2016-2024
    • /
    • 2000
  • 본 논문에서는 결함의 영향을 받지 않고 동작할 수 있는 AVTMR(All Voting Triple Modular Redundancy) 시스템을 개발하였으며, MILSPEC-217F에 기반을 둔 고장율을 계산하여 AVTMR과 SS(Single System) 시스템을 비교 및 평가하였다. 설계된 시스템은 MC68000을 기반으로 한 3중화된 다수결 보터(Triplicated Majority Voter)를 이용하여 시스템을 개발하였다. 본 논문에서는 시스템의 신뢰도(Reliability), 가용도(Avaliability), 유지보수도(Maintainability)를 마코브 모델(Markov model)로 평가하였으며, 또한 시스템의 MTTF(Mean Time to Failure)를 계산하여 시스템의 수명을 구하였고, 설계된 AVTMR 시스템이 SS(Single System)보다 전체 시스템 평가에서 우수한 특성을 가진다는 것을 시뮬레이션을 통해 알 수 있었다. 또한, AVTMR 시스템은 결함을 허용(Fault tolerant)하는 시스템 특성을 가지기 때문에, 인간의 생명과 관련된 철도 시스템, 선박 시스템이나 항공기 시스템에 적용될 수 있다.

  • PDF

Implementation of High-Reliable MVB Network for Safety System of Nuclear Power Plant (원자력발전소 안전계통용 고신뢰성 MVB 네트워크 구현)

  • Sul, Jae-Yoon;Kim, Ki-Chang;Kim, Yoo-Sung;Park, Jae-Hyun
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.61 no.6
    • /
    • pp.859-864
    • /
    • 2012
  • The computer network plays an important role in modern digital controllers within a safety system of a nuclear power plant. For the reliable and realtime data communication between controllers, this paper proposes a modified high-reliable MVB(multi-function vehicle bus) as a main control network for a safety system of a nuclear power plant. The proposed network supports the state-based communication in order to ensure the deterministic communication latency, and very fast network recovery when the bus master fails compare to the standard MVB. This paper also shows the implementation results using a FPGA-based testbed.

A study on the comparision of AVTMR (All Voting Triple Modular Redundancy) and Dual-Duplex system (AVTMR 과 듀얼 듀플렉스 시스템 비교에 관한 연구)

  • 김현기;신석균;이기서
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.26 no.6A
    • /
    • pp.1067-1077
    • /
    • 2001
  • 본 논문에서는 결함의 영향을 받지 않고 동작할 수 있는 AVTMR(All Voting Triple Modular Redundancy) 시스템과 듀얼 듀플렉스(Dual-duplex) 시스템을 설계하고, 각 시스템의 평가를 통하여 RAMS(Reliability, Avaliability, Maintainability, Safety)를 비교하였다. ABTMR 시스템은 3중화된 보터(voter)를 사용하여 설계를 하였으며, 듀얼 듀플렉스 시스템은 비교기(comparator)를 이용하여 시스템을 설계하였다. 각 시스템은 버스 레벨로 데이터를 비교하도록 설계하였으며, 시스템 평가를 위해서 소자의 고장율은 MILSPEC-217F에 기반을 두고 RELEX6.0을 이용하였고, 마코브 모델(Markov model)을 이용하여 시스템의 RAMS를 평가하였다. 본 논문에서는 각 시스템을 MC68000을 기반으로 설계하여, 각각 시스템에 사용되는 비용 및 시스템이 어느 부분에서 선호될 수 있는가를 RAMS 및 MTTF(Mean Time To Failure)를 통하여 선택할 수 있는 기반을 제시하도록 나타내고 있다. 이러한 AVTMR이나 듀얼 듀플렉스 시스템(dual-duplex system)은 결함 허용 시스템(fault tolerant system)으로 인간의 생명과 직접적인 관련이 있는 고속철도 시스템이나 항공기 시스템에 적용될 수 있다.

  • PDF

$H_{\infty}$ Controller Design for Electromagnetic Suspension System using LMIs (LMI를 이용한 자기부상 시스템의 $H_{\infty}$ 제어기 설계)

  • Jang, S.M.;Sung, S.Y.;Sung, H.K.;Kim, B.S.
    • Proceedings of the KIEE Conference
    • /
    • 2000.11b
    • /
    • pp.280-283
    • /
    • 2000
  • In this paper, a fault tolerant control problem is considered for a class of nonlinear system formulated in a gain scheduling form with LMI-based H-inf control technique Key benefits of this proposed scheme are demonstrated in the simulation of an electromagnetic suspension system with actuator and/or sensor failures, and the method is compared with the convensional state-feedback and output-feedback controller. It is clearly observed that the proposed control scheme shows an improved output performance in comparision with convensional methods.

  • PDF

Synchronize Ethernet-based Fault Injection Algorithm Implementation for Intelligent Automotive Network (차량용 지능형 네트워크에서의 동기식 이더넷중심 오류 주입 알고리즘 구현☆)

  • Jang, Eunji;Kim, Inyoung;Lee, Woongjae
    • Journal of Internet Computing and Services
    • /
    • v.17 no.4
    • /
    • pp.43-50
    • /
    • 2016
  • In this paper, we propose the protocol of Ethernet that will receive a popular interesting in the automotive intelligent network, it also attempts to implementation and verification through simulation and experiments to propose a fault tolerance algorithm when the data transfer on it. It has proven the usefulness of the system in order to apply toward an existing automotive communication system. In the case of actual real-time data for automotive industry, we generated a randomly-generated data which is the set of payload into a standard format to complete the experiment. Among the implemented existing algorithms performance, we confirmed the effectiveness of all range from a single data to mixed (Hybrid-type) data, to verify the proposed algorithm.

Efficient Fault-Tolerant Multicast on Hypercube Multicomputer System (하이퍼 큐브 컴퓨터에서 효과적인 오류 허용 다중전송기법)

  • 명훈주;김성천
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.30 no.5_6
    • /
    • pp.273-279
    • /
    • 2003
  • Hypercube multicomputers have been drawing considerable attention from many researchers due to their regular structure and short diameter. One of keys to the performance of Hypercube is the efficiency of communication among processors. Among several communication patterns, multicast is important, which is found in a variety of applications as data replication and signal processing. As the number of processors increases, the probability of occurrences of fault components also increases. So it would be desirable to design an efficient scheme that multicasts messages in the presence of faulty component. In fault-tolerant routing and multicast, there are local information based scheme, global information based scheme and limited information based scheme in terms of information. In general, limited information is easy to obtain and maintain by compressing information in a concise format. In this paper, we propose a new routing scheme and a new multicast scheme using recently proposed fully reachability information scheme and new local information scheme. The proposed multicast scheme increases multicast success possibility and reduce deroute cases. Experiments show that multicast success possibility can increase at least 15% compared to previous method.