• 제목/요약/키워드: Software fault-tolerance

검색결과 89건 처리시간 0.027초

검수고에서 소프트웨어 결함허용기법을 고려한 가상궤도회로의 적용에 대한 연구 (A Study on the Application of Virtual Track Circuit by Considering Software Fault Tolerance Techniques in Depot)

  • 이명철;고영환;김민석;이종우
    • 한국철도학회논문집
    • /
    • 제15권2호
    • /
    • pp.122-128
    • /
    • 2012
  • 차량기지 검수고에 구조를 고려하였을 시, 철제빔으로 인해 열차를 검지하기 위한 궤도회로를 설치하기가 어렵고, 레일과 대지가 철제빔으로 연결되어 누설전류가 많아지므로 궤도회로 시스템을 적용할 수 없다. 그러므로 열차가 검수고에 들어오면 수작업으로 차량이 있다는 표시를 하여 사용하고 있으며, 잘못 취급할 시에는 열차충돌 혹은 탈선으로 이어지는 사고가 발생할 수 있다. 본 논문에서는 검수고에서 소프트웨어 프로그램을 이용하여 가상궤도회로를 적용하였다. 가상궤도회로의 알고리즘을 제시하였으며, 열차가 점유된 검수고 방향으로 신호를 취급하는 경우에 검수고 열차점유 표시 및 신호기 정지신호 표시로 인해 검수고에서 열차의 안전성을 확보하였다. 또한 프로그램의 신뢰도 및 가용도를 분석하여 소프트웨어에 적합한 결함허용 기법을 적용하였다.

안전성 확보를 위한 위험원 분석 기법간 상관관계에 대한 연구 (A study on the Correlation Hazard Analysis for Signaling System Safety)

  • 한찬희;이영수;안진;조우식
    • 한국철도학회:학술대회논문집
    • /
    • 한국철도학회 2007년도 추계학술대회 논문집
    • /
    • pp.638-645
    • /
    • 2007
  • Computers are increasingly being introduced into safety and reliability critical systems. The safe and reliable operation of these systems cannot be taken for granted. Malfunctions of these systems can have potentially catastrophic consequences and they have already been involved in serious accidents. Software fault prevention, fault tolerance, fault removal and fault forecasting are the techniques to be used, implemented and verified for embedded software in critical systems as the contributors to safety and reliability of the software. To use them when developing a software product, a relationship must be established between them and the development processes, the methods and techniques to be used to develop software, as well as with the different product architectures. Railroad signaling system software is a safety-critical embedded software with realtime and high reliability requirements. The primary purpose of the safety management is to prevent the loss of lives or physical damages arising from potential hazards in the railroad signaling system. This study provides a systematic approach to analysis of potential hazards for their management during the system life cycle to assure the identification and definition of the most appropriate hazards.

  • PDF

FAULT-TOLERANT DESIGN FOR ADVANCED DIVERSE PROTECTION SYSTEM

  • Oh, Yang Gyun;Jeong, Kin Kwon;Lee, Chang Jae;Lee, Yoon Hee;Baek, Seung Min;Lee, Sang Jeong
    • Nuclear Engineering and Technology
    • /
    • 제45권6호
    • /
    • pp.795-802
    • /
    • 2013
  • For the improvement of APR1400 Diverse Protection System (DPS) design, the Advanced DPS (ADPS) has recently been developed to enhance the fault tolerance capability of the system. Major fault masking features of the ADPS compared with the APR1400 DPS are the changes to the channel configuration and reactor trip actuation equipment. To minimize the fault occurrences within the ADPS, and to mitigate the consequences of common-cause failures (CCF) within the safety I&C systems, several fault avoidance design features have been applied in the ADPS. The fault avoidance design features include the changes to the system software classification, communication methods, equipment platform, MMI equipment, etc. In addition, the fault detection, location, containment, and recovery processes have been incorporated in the ADPS design. Therefore, it is expected that the ADPS can provide an enhanced fault tolerance capability against the possible faults within the system and its input/output equipment, and the CCF of safety systems.

Fault Tolerant Operation of CHB Multilevel Inverters Based on the SVM Technique Using an Auxiliary Unit

  • Kumar, B. Hemanth;Lokhande, Makarand M.;Karasani, Raghavendra Reddy;Borghate, Vijay B.
    • Journal of Power Electronics
    • /
    • 제18권1호
    • /
    • pp.56-69
    • /
    • 2018
  • In this paper, an improved Space Vector Modulation (SVM) based fault tolerant operation on a nine-level Cascaded H-Bridge (CHB) inverter with an additional backup circuit is proposed. Any type of fault in a power converter may result in a power interruption and productivity loss. Three different faults on H-bridge modules in all three phases based on the SVM approach are investigated with diagrams. Any fault in an inverter phase creates an unbalanced output voltage, which can lead to instability in the system. An additional auxiliary unit is connected in series to the three phase cascaded H-bridge circuit. With the help of this and the redundant switching states in SVM, the CHB inverter produces a balanced output with low harmonic distortion. This ensures high DC bus utilization under numerous fault conditions in three phases, which improves the system reliability. Simulation results are presented on three phase nine-level inverter with the automatic fault detection algorithm in the MATLAB/SIMULINK software tool, and experimental results are presented with DSP on five-level inverter to validate the practicality of the proposed SVM fault tolerance strategy on a CHB inverter with an auxiliary circuit.

Ad hoc Software Rejuvenation for Survivability

  • Khin Mi Mi Aung;Park, Jong-Sou
    • 한국정보보호학회:학술대회논문집
    • /
    • 한국정보보호학회 2003년도 동계학술대회
    • /
    • pp.141-145
    • /
    • 2003
  • We propose the model of Software Rejuvenation methodology, which is applicable for survivability. Software rejuvenation is a proactive fault management technique and being used in fault tolerant systems as a cost effective technique for dealing with software faults. Survivability focuses on delivery of essential services and preservation of essential assets, even systems are penetrated and compromised. Thus, our objective is to detect the intrusions in a real time and survive in face of such attacks. As we deterrent against an attack in a system level, the Intrusion tolerance could be maximized at the target environment. We address the optimal time to execute ad hoc software rejuvenation and we compute it by using the semi Markov process. This is one way that could be really frustrated and deterred the attacks, as the attacker can't make their progress. This Software Rejuvenation method can be very effective under the assumption of unknown attacks. In this paper, we compute the optimum time to perform an ad hoc Software Rejuvenation through intrusions.

  • PDF

함정 전투체계에서 통합 통제 시스템의 고장 감내를 지원하기 위한 리더 선정 알고리즘 개발 (Development of Leader Selection Algorithm to Support Fault Tolerance of Integrated Management Systems in the Naval Combat System)

  • 서용진;조준영;김현수;고영근;김점수
    • 한국군사과학기술학회지
    • /
    • 제22권3호
    • /
    • pp.382-391
    • /
    • 2019
  • The naval combat system is a distributed system in which various subsystems are integrated and operated together. The integrated management system(IMS) is a software system for systematically and consistently managing the application software which control and operate various devices in such a combat system. Since the malfunction or failure of such an IMS can disable the entire combat system, the IMS is more important than other application software of the combat system. In this paper, we propose a method to guarantee the stable and correct operation of the combat system. To this end, we propose a redundancy scheme composed of one leader and several followers so as to tolerate the failure situation of the IMS. We also propose a leader selection algorithm to select a new leader when the leader fails and can no longer perform its role. To verify the validity of the study, we verify the fault tolerance behavior of the system and the accuracy of the leader selection algorithm.

Analytic Model for Optimal Checkpoints in Mobile Real-time Systems

  • Lim, Sung-Hwa;Lee, Byoung-Hoon;Kim, Jai-Hoon
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제10권8호
    • /
    • pp.3689-3700
    • /
    • 2016
  • It is not practically feasible to apply hardware-based fault-tolerant schemes, such as hardware replication, in mobile devices. Therefore, software-based fault-tolerance techniques, such as checkpoint and rollback schemes, are required. In checkpoint and rollback schemes, the optimal checkpoint interval should be applied to obtain the best performance. Most previous studies focused on minimizing the expected execution time or response time for completing a given task. Currently, most mobile applications run in real-time environments. Therefore, it is extremely essential for mobile devices to employ optimal checkpoint intervals as determined by the real-time constraints of tasks. In this study, we tackle the problem of determining the optimal inter-checkpoint interval of checkpoint and rollback schemes to maximize the deadline meet ratio in real-time systems and to build a probabilistic cost model. From this cost model, we can numerically find the optimal checkpoint interval using mathematical tools. The performance of the proposed solution is evaluated using analytical estimates.

리눅스 클러스터링 웹 서버의 고가용성에 대한 연구 (A study on high availability of the linux clustering web server)

  • 박지현;이상문;홍태화;김학배
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2000년도 제15차 학술회의논문집
    • /
    • pp.88-88
    • /
    • 2000
  • As more and more critical commercial applications move on the Internet, providing highly available servers becomes increasingly important. One of the advantages of a clustered system is that it has hardware and software redundancy. High availability can be provided by detecting node or daemon failure and reconfiguring the system appropriately so that the workload can be taken over bi the remaining nodes in the cluster. This paper presents how to provide the guaranteeing high availability of clustering web server. The load balancer becomes a single failure point of the whole system. In order to prevent the failure of the load balancer, we setup a backup server using heartbeat, fake, mon, and checkpointing fault-tolerance method. For high availability of file servers in the cluster, we setup coda file system. Coda is a advanced network fault-tolerance distributed file system.

  • PDF

그리드 환경의 적응형 오류 극복 관리 시스템 설계 및 구현 (Design and Implementation of Adaptive Fault-Tolerant Management System over Grid)

  • 김은경;김지영;김윤희
    • 정보처리학회논문지A
    • /
    • 제15A권3호
    • /
    • pp.151-154
    • /
    • 2008
  • 서비스 이동과 자원 상태 변화 등 실행 환경 변화가 빈번히 발생하는 그리드 컴퓨팅 환경은 다양한 응용 프로그램 작업 환경을 지원하고 사용자에게 끊임없는 작업 환경을 보장하기 위하여 고가용성을 지원하는 미들웨어가 필수적으로 필요하다. 기존의 분산 환경 미들웨어 역시 고가용성 지원 서비스가 일부 연구자에 의해 진행되고 있으나 공개표준은 아니며 다양한 그리드 서비스에 대한 고려가 없다. 본 논문에서는 환경에 따라 적응하는 서비스 미들웨어 런타임 서비스 관리 시스템을 통해 자율적으로 작업 환경을 재구성하도록 하여 미들웨어의 가용성을 증대시키고 안정적으로 서비스의 계속성과 데이터 및 자료의 일관성을 보장하는 방법을 제시하고 프로토타입 Wapee(Web-Service based Application Execution Environment)를 통해 실제 환경에서 적용 가능성을 확인한다.

4륜 조향을 이용한 Steer-by-Wire 시스템의 고장 허용 제어 전략 (Fault Tolerant Control Strategy for Four Wheel Steer-by-Wire Systems)

  • 노성훈;권백순
    • 자동차안전학회지
    • /
    • 제15권2호
    • /
    • pp.13-20
    • /
    • 2023
  • This paper presents a fault tolerant control strategy for Steer-by-Wire (SbW) systems. Among many problems to be solved before commercialization of SbW systems, maintaining reliability and fault tolerance in such systems are the most pressing issues. In most previous studies, dual steering motors are used to achieve actuation redundancy. However, relatively few studies have been conducted to introduce fault tolerant control strategies using rear wheel steering system. In this work, an actuator fault in front wheel steering is compensated by active rear wheel steering. The proposed fault tolerant control algorithm consists of disturbance observer and sliding mode control. The fault tolerant control performance of the proposed approach is validated via computer simulation studies with Carsim vehicle dynamics software and MATLAB/Simulink.