• Title/Summary/Keyword: Software fault tolerance

Search Result 89, Processing Time 0.031 seconds

A Study on the Application of Virtual Track Circuit by Considering Software Fault Tolerance Techniques in Depot (검수고에서 소프트웨어 결함허용기법을 고려한 가상궤도회로의 적용에 대한 연구)

  • Lee, Myoung-Chol;Ko, Young-Hwan;Kim, Min-Seok;Lee, Jong-Woo
    • Journal of the Korean Society for Railway
    • /
    • v.15 no.2
    • /
    • pp.122-128
    • /
    • 2012
  • Considering structure of depot, it is impossible to install the track circuit systems due to iron-beam. Because rails and earth are connected by the iron-beam, there is much leakage current. So, it is hard to apply the track circuit systems. Thus, when trains go to the depot, sign which indicates existence of trains is used manually. In case of wrong sign, accidents occur such as train crash, derailment etc. Currently, location of trains has been found by using optical sensor in the depot to prevent the accidents. However, it costs a great deal to install and maintain the optical sensor. Therefore, this method is hardly used in train operation institutes. In this paper, virtual track circuit systems are introduced by using software program in the depot. Also, algorithm of the virtual track circuit systems is proposed. In case that signal is handled to the depot which is occupied by the train, safety is ensured by indicating sign which means existence of trains and stop signal. Also, proper fault tolerance techniques are proposed to the software by analyzing reliability and availability.

A study on the Correlation Hazard Analysis for Signaling System Safety (안전성 확보를 위한 위험원 분석 기법간 상관관계에 대한 연구)

  • Han, Chan-Hee;Lee, Young-Soo;Ahn, Jin;Jo, Woo-Sic
    • Proceedings of the KSR Conference
    • /
    • 2007.11a
    • /
    • pp.638-645
    • /
    • 2007
  • Computers are increasingly being introduced into safety and reliability critical systems. The safe and reliable operation of these systems cannot be taken for granted. Malfunctions of these systems can have potentially catastrophic consequences and they have already been involved in serious accidents. Software fault prevention, fault tolerance, fault removal and fault forecasting are the techniques to be used, implemented and verified for embedded software in critical systems as the contributors to safety and reliability of the software. To use them when developing a software product, a relationship must be established between them and the development processes, the methods and techniques to be used to develop software, as well as with the different product architectures. Railroad signaling system software is a safety-critical embedded software with realtime and high reliability requirements. The primary purpose of the safety management is to prevent the loss of lives or physical damages arising from potential hazards in the railroad signaling system. This study provides a systematic approach to analysis of potential hazards for their management during the system life cycle to assure the identification and definition of the most appropriate hazards.

  • PDF

FAULT-TOLERANT DESIGN FOR ADVANCED DIVERSE PROTECTION SYSTEM

  • Oh, Yang Gyun;Jeong, Kin Kwon;Lee, Chang Jae;Lee, Yoon Hee;Baek, Seung Min;Lee, Sang Jeong
    • Nuclear Engineering and Technology
    • /
    • v.45 no.6
    • /
    • pp.795-802
    • /
    • 2013
  • For the improvement of APR1400 Diverse Protection System (DPS) design, the Advanced DPS (ADPS) has recently been developed to enhance the fault tolerance capability of the system. Major fault masking features of the ADPS compared with the APR1400 DPS are the changes to the channel configuration and reactor trip actuation equipment. To minimize the fault occurrences within the ADPS, and to mitigate the consequences of common-cause failures (CCF) within the safety I&C systems, several fault avoidance design features have been applied in the ADPS. The fault avoidance design features include the changes to the system software classification, communication methods, equipment platform, MMI equipment, etc. In addition, the fault detection, location, containment, and recovery processes have been incorporated in the ADPS design. Therefore, it is expected that the ADPS can provide an enhanced fault tolerance capability against the possible faults within the system and its input/output equipment, and the CCF of safety systems.

Fault Tolerant Operation of CHB Multilevel Inverters Based on the SVM Technique Using an Auxiliary Unit

  • Kumar, B. Hemanth;Lokhande, Makarand M.;Karasani, Raghavendra Reddy;Borghate, Vijay B.
    • Journal of Power Electronics
    • /
    • v.18 no.1
    • /
    • pp.56-69
    • /
    • 2018
  • In this paper, an improved Space Vector Modulation (SVM) based fault tolerant operation on a nine-level Cascaded H-Bridge (CHB) inverter with an additional backup circuit is proposed. Any type of fault in a power converter may result in a power interruption and productivity loss. Three different faults on H-bridge modules in all three phases based on the SVM approach are investigated with diagrams. Any fault in an inverter phase creates an unbalanced output voltage, which can lead to instability in the system. An additional auxiliary unit is connected in series to the three phase cascaded H-bridge circuit. With the help of this and the redundant switching states in SVM, the CHB inverter produces a balanced output with low harmonic distortion. This ensures high DC bus utilization under numerous fault conditions in three phases, which improves the system reliability. Simulation results are presented on three phase nine-level inverter with the automatic fault detection algorithm in the MATLAB/SIMULINK software tool, and experimental results are presented with DSP on five-level inverter to validate the practicality of the proposed SVM fault tolerance strategy on a CHB inverter with an auxiliary circuit.

Ad hoc Software Rejuvenation for Survivability

  • Khin Mi Mi Aung;Park, Jong-Sou
    • Proceedings of the Korea Institutes of Information Security and Cryptology Conference
    • /
    • 2003.12a
    • /
    • pp.141-145
    • /
    • 2003
  • We propose the model of Software Rejuvenation methodology, which is applicable for survivability. Software rejuvenation is a proactive fault management technique and being used in fault tolerant systems as a cost effective technique for dealing with software faults. Survivability focuses on delivery of essential services and preservation of essential assets, even systems are penetrated and compromised. Thus, our objective is to detect the intrusions in a real time and survive in face of such attacks. As we deterrent against an attack in a system level, the Intrusion tolerance could be maximized at the target environment. We address the optimal time to execute ad hoc software rejuvenation and we compute it by using the semi Markov process. This is one way that could be really frustrated and deterred the attacks, as the attacker can't make their progress. This Software Rejuvenation method can be very effective under the assumption of unknown attacks. In this paper, we compute the optimum time to perform an ad hoc Software Rejuvenation through intrusions.

  • PDF

Development of Leader Selection Algorithm to Support Fault Tolerance of Integrated Management Systems in the Naval Combat System (함정 전투체계에서 통합 통제 시스템의 고장 감내를 지원하기 위한 리더 선정 알고리즘 개발)

  • Seo, Yongjin;Jo, Jun Young;Kim, Hyeon Soo;Go, Youngkeun;Kim, Chum-Soo
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.22 no.3
    • /
    • pp.382-391
    • /
    • 2019
  • The naval combat system is a distributed system in which various subsystems are integrated and operated together. The integrated management system(IMS) is a software system for systematically and consistently managing the application software which control and operate various devices in such a combat system. Since the malfunction or failure of such an IMS can disable the entire combat system, the IMS is more important than other application software of the combat system. In this paper, we propose a method to guarantee the stable and correct operation of the combat system. To this end, we propose a redundancy scheme composed of one leader and several followers so as to tolerate the failure situation of the IMS. We also propose a leader selection algorithm to select a new leader when the leader fails and can no longer perform its role. To verify the validity of the study, we verify the fault tolerance behavior of the system and the accuracy of the leader selection algorithm.

Analytic Model for Optimal Checkpoints in Mobile Real-time Systems

  • Lim, Sung-Hwa;Lee, Byoung-Hoon;Kim, Jai-Hoon
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.8
    • /
    • pp.3689-3700
    • /
    • 2016
  • It is not practically feasible to apply hardware-based fault-tolerant schemes, such as hardware replication, in mobile devices. Therefore, software-based fault-tolerance techniques, such as checkpoint and rollback schemes, are required. In checkpoint and rollback schemes, the optimal checkpoint interval should be applied to obtain the best performance. Most previous studies focused on minimizing the expected execution time or response time for completing a given task. Currently, most mobile applications run in real-time environments. Therefore, it is extremely essential for mobile devices to employ optimal checkpoint intervals as determined by the real-time constraints of tasks. In this study, we tackle the problem of determining the optimal inter-checkpoint interval of checkpoint and rollback schemes to maximize the deadline meet ratio in real-time systems and to build a probabilistic cost model. From this cost model, we can numerically find the optimal checkpoint interval using mathematical tools. The performance of the proposed solution is evaluated using analytical estimates.

A study on high availability of the linux clustering web server (리눅스 클러스터링 웹 서버의 고가용성에 대한 연구)

  • 박지현;이상문;홍태화;김학배
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2000.10a
    • /
    • pp.88-88
    • /
    • 2000
  • As more and more critical commercial applications move on the Internet, providing highly available servers becomes increasingly important. One of the advantages of a clustered system is that it has hardware and software redundancy. High availability can be provided by detecting node or daemon failure and reconfiguring the system appropriately so that the workload can be taken over bi the remaining nodes in the cluster. This paper presents how to provide the guaranteeing high availability of clustering web server. The load balancer becomes a single failure point of the whole system. In order to prevent the failure of the load balancer, we setup a backup server using heartbeat, fake, mon, and checkpointing fault-tolerance method. For high availability of file servers in the cluster, we setup coda file system. Coda is a advanced network fault-tolerance distributed file system.

  • PDF

Design and Implementation of Adaptive Fault-Tolerant Management System over Grid (그리드 환경의 적응형 오류 극복 관리 시스템 설계 및 구현)

  • Kim, Eun-Kyung;Kim, Jeu-Young;Kim, Yoon-Hee
    • The KIPS Transactions:PartA
    • /
    • v.15A no.3
    • /
    • pp.151-154
    • /
    • 2008
  • A middleware in grid computing environment is required to support seamless on-demand services over diverse resource situations in order to meet various user requirements [1]. Since grid computing applications need situation-aware middleware services in this environment. In this paper, we propose a semantic middleware architecture to support dynamic software component reconfiguration based fault and service ontology to provide fault-tolerance in a grid computing environment. Our middleware includes autonomic management to detect faults, analyze causes of them, and plan semantically meaningful strategies to recover from the failure using pre-defined fault and service ontology trees. We implemented a referenced prototype, Web-service based Application Execution Environment(Wapee), as a proof-of-concept, and showed the efficiency in runtime recovery.

Fault Tolerant Control Strategy for Four Wheel Steer-by-Wire Systems (4륜 조향을 이용한 Steer-by-Wire 시스템의 고장 허용 제어 전략)

  • Seonghun Noh;Baek-soon Kwon
    • Journal of Auto-vehicle Safety Association
    • /
    • v.15 no.2
    • /
    • pp.13-20
    • /
    • 2023
  • This paper presents a fault tolerant control strategy for Steer-by-Wire (SbW) systems. Among many problems to be solved before commercialization of SbW systems, maintaining reliability and fault tolerance in such systems are the most pressing issues. In most previous studies, dual steering motors are used to achieve actuation redundancy. However, relatively few studies have been conducted to introduce fault tolerant control strategies using rear wheel steering system. In this work, an actuator fault in front wheel steering is compensated by active rear wheel steering. The proposed fault tolerant control algorithm consists of disturbance observer and sliding mode control. The fault tolerant control performance of the proposed approach is validated via computer simulation studies with Carsim vehicle dynamics software and MATLAB/Simulink.