• Title/Summary/Keyword: Software fault-tolerance

Search Result 89, Processing Time 0.023 seconds

A Study on Design and Reliability Assessment for Embedded Hot-Standby Sparing FT System Using Self-Checking Logic (자기검사회로를 이용한 대기이중계구조 결함허용제어기의 설계 및 신뢰도평가에 관한 연구)

  • Lee, Jae-Ho;Lee, Kang-Mi;Kim, Young-Kyu;Shin, Duc-Ko
    • Journal of the Korean Society for Railway
    • /
    • v.9 no.6 s.37
    • /
    • pp.725-731
    • /
    • 2006
  • Hot Standby sparing system detecting faults by using software, and being tolerant any faults by using Hardware Redundancy is difficult to perform quantitative reliability prediction and to detect real time faults. Therefore, this paper designs Hot Standby sparing system using hardware basis self checking logic in order to overcome this problem. It also performs failure mode analysis of Hot Standby sparing system with designed self checking logic by using FMEA (Failure Mode Effect Analysis), and identifies reliability assessment of the controller designed by quantifying the numbers of failure development by using FTA (Fault Tree Analysis)

A Study on the Software Reliability Improvement in Railway Control System (열차제어시스템 소프트웨어의 신뢰성 향상에 대한 고찰)

  • Kim, Jong-Ki;Choi, Kyoo-Hyung;Lee, Jong-Woo
    • Proceedings of the KIEE Conference
    • /
    • 2001.10a
    • /
    • pp.275-277
    • /
    • 2001
  • 열차제어시스템의 소프트웨어는 높은 신뢰성과 안전성이 요구된다. 이를 위해 철도에서는 고장회피(Fault Avoidance) 기법과 고장허용(Fault Tolerance)기법을 사용하고 있다. 그러나 최근 신호설비의 소프트웨어 신뢰성을 좀더 향상시키기 위해 일본, 프랑스, 스웨덴 등에서는 안전에 치명적인 시스템에 사용되고 있는 정형기법(Forma) Method)을 포함하여 많은 연구를 수행중이다. 본고에서는 국외 관련분야를 분석하고, 열차제어시스템 소프트웨어의 신뢰성 향상에 관하여 기술하고자 한다.

  • PDF

A Design and Implementation of Fault Tolerance Agent on Distributed Multimedia Environment (분산 멀티미디어 환경에서 결함 허용 에이전트의 설계 및 구현)

  • Go, Eung-Nam;Hwang, Dae-Jun
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.10
    • /
    • pp.2618-2629
    • /
    • 1999
  • In this paper, we describe the design and implementation of the FDRA(Fault Detection Recovery based on Agent) running on distributed multimedia environment. DOORAE is a good example for distributed multimedia and multimedia distance education system among students and teachers during lecture. It has primitive service agents. Service functions are implemented with objected oriented concept. FDRA is a multi-agent system. It has been environment, intelligent agents interact with each other, either collaboratively or non-collaboratively, to achieve their goals. The main idea is to detect an error by using polling method. This system detects an error by polling periodically the process with relation to session. And, it is to classify the type of error s automatically by using learning rules. The merit of this system is to use the same method to recovery it as it creates a session. FDRA is a system that is able to detect an error, to classify an error type, and to recover automatically a software error based on distributed multimedia environment.

  • PDF

A Development of mobile broadcasting monitor for improving reliability on IP-TV Platform based on TIT (TIT 기반에 IP-TV 플랫폼의 신뢰성 향상을 위한 방송 모니터 개발)

  • Sso, Sang-Jin;Jin, Hyun-Joon;Park, Noh-Kyung
    • Journal of Internet Computing and Services
    • /
    • v.8 no.5
    • /
    • pp.59-66
    • /
    • 2007
  • In korea, TIT(Transport Information Technology) based IP-TV services have been provided in Saemaeul trains and some sections of subway trains, But the software systems for the service performed in alternated fashions and suffered from many problems such as suspension, memory leaking and overflow, These problems increased playback loss time and resulted in bad reliabilities, In this paper, a software TIT monitor is designed and implemented for Monitoring module and Reset module in physically poor environments, The designed system formalized monitoring time intervals for effective monitoring, Through the real experiments, playback time is improved in 7.2% comparing to existing system.

  • PDF

A Study on Validation Testing for Input Files of MS Word-Processor (MS 워드프로세서의 입력 파일에 대한 유효성 테스팅 방법에 관한 연구)

  • Yun, Young-Min;Choi, Jong-Cheon;Yoo, Hae-Young;Cho, Seong-Je
    • The KIPS Transactions:PartC
    • /
    • v.14C no.4
    • /
    • pp.313-320
    • /
    • 2007
  • In this paper, we propose a method to analyze security vulnerabilities of MS word-processor by checking the validation of its input files. That is, this study is to detect some vulnerabilities in the input file of the word processor by analyzing the header information of its input file. This validation test can not be conducted by the existing software fault injection tools including Holodeck and CANVAS. The proposed method can be also applied to identify the input file vulnerabilities of Hangul and Microsoft Excel which handle a data file with a header as an input. Moreover, our method can provide a means for assessing the fault tolerance and trustworthiness of the target software.

A Configurable Software-based Approach for Detecting CFEs Caused by Transient Faults

  • Liu, Wei;Ci, LinLin;Liu, LiPing
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.5
    • /
    • pp.1829-1846
    • /
    • 2021
  • Transient faults occur in computation units of a processor, which can cause control flow errors (CFEs) and compromise system reliability. The software-based methods perform illegal control flow detection by inserting redundant instructions and monitoring signature. However, the existing methods not only have drawbacks in terms of performance overhead, but also lack of configurability. We propose a configurable approach CCFCA for detecting CFEs. The configurability of CCFCA is implemented by analyzing the criticality of each region and tuning the detecting granularity. For critical regions, program blocks are divided according to space-time overhead and reliability constraints, so that protection intensity can be configured flexibly. For other regions, signature detection algorithms are only used in the first basic block and last basic block. This helps to improve the fault-tolerant efficiency of the CCFCA. At the same time, CCFCA also has the function of solving confusion and instruction self-detection. Our experimental results show that CCFCA incurs only 10.61% performance overhead on average for several C benchmark program and the average undetected error rate is only 9.29%. CCFCA has high error coverage and low overhead compared with similar algorithms. This helps to meet different cost requirements and reliability requirements.

Distributed Information Extraction in Wireless Sensor Networks using Multiple Software Agents with Dynamic Itineraries

  • Gupta, Govind P.;Misra, Manoj;Garg, Kumkum
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.8 no.1
    • /
    • pp.123-144
    • /
    • 2014
  • Wireless sensor networks are generally deployed for specific applications to accomplish certain objectives over a period of time. To fulfill these objectives, it is crucial that the sensor network continues to function for a long time, even if some of its nodes become faulty. Energy efficiency and fault tolerance are undoubtedly the most crucial requirements for the design of an information extraction protocol for any sensor network application. However, most existing software agent based information extraction protocols are incapable of satisfying these requirements because of static agent itineraries and large agent sizes. This paper proposes an Information Extraction protocol based on Multiple software Agents with Dynamic Itineraries (IEMADI), where multiple software agents are dispatched in parallel to perform tasks based on the query assigned to them. IEMADI decides the itinerary for an agent dynamically at each hop using local information. Through mathematical analysis and simulation, we compare the performance of IEMADI with a well known static itinerary based protocol with respect to energy consumption and response time. The results show that IEMADI provides better performance than the static itinerary based protocols.

An Object-Oriented Software Development Framework for Autonomous Decentralized Systems (자치적 분산처리 시스템을 위한 객체지향 소프트웨어 개발 프레임워크에 대한 연구)

  • 염근혁
    • The Journal of Information Technology and Database
    • /
    • v.4 no.1
    • /
    • pp.105-116
    • /
    • 1997
  • 분산처리 시스템을 위한 소프트웨어 개발 방법 및 이를 지원하는 프레임워크의 개발은 분산처리 시스템을 위한 좋은 소프트웨어의 개발을 위해 매우 필요한 일이라 할 수 있다. 즉, 분산처리 시스템을 위한 신뢰성 있고 적합성과 확장성을 가진 소프트웨어의 개발은 매우 중요한 일이다. 자치적 분산처리 시스템(Autonomous Decentralized Systems)은 온라인 확장성과 온라인 유지보수성 및 fault tolerance기능을 가진 분산처리 시스템이다. 이 논문에서는 자치적 분산처리 시스템을 위한 객체지향 소프트웨어 개발을 지원하는 프레임워크에 대하여 논한다. 이 프레임워크는 객체지향 요구 분석과 객체지향 설계, 구현, 프로세서에 타스크(task)의 할당, 검증 및 유지보수로 구성되어 있으며, 온라인 확장성과 온라인 변경성(modifiability)을 지원한다. 자치적 분산처리 애플리케이션 소프트웨어 개발을 지원하는 프레임워크는 객체지향 computation 모델을 근간으로 하고 있다. 이 논문에서는 프레임워크외에도 자치적 분산처리 소프트웨어 개발을 위한 CASE(Computer Aided Software Engineering) 환경에 대하여서도 논한다.

  • PDF

A Fault-Tolerance Agent for Multimedia Collaboration Works running on Vehicle Environment (차량 환경 상에서 멀티미디어 공동 작업을 위한 결함 허용 에이전트)

  • Ko, Eung-Nam
    • Journal of Advanced Navigation Technology
    • /
    • v.15 no.1
    • /
    • pp.157-161
    • /
    • 2011
  • This paper explains an error process for multimedia collaboration works with session management running on vehicle network environment. This system consists of an FDA and FRA. FDA is an agent that detects an error by hooking techniques for multimedia system based on vehicle network environment with session management. FRA is a system that is suitable for recovering software error for multimedia system with session management based on vehicle network environment. This paper describes only FRA. When multiple local sessions is opened, each local session manager sends information of participant to global session manager and take current information about session of processing in network.

Availability Improvement Model of (n,k) Cluster Systems using Software Rejuvenation (소프트웨어 재활기법을 적용한 (n,k) 클러스터 시스템의 가용도 향상 모델)

  • 이재성;박기진;강창훈;박범주;김성수
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.30 no.5_6
    • /
    • pp.290-298
    • /
    • 2003
  • Internet-based computer systems have to provide both high-availability and high-performance. Cluster technology has been used to obtain availability and performance simultaneously Generally, high-availability cluster systems tolerate a failure of a cluster node and cost-effectively solve it. In this paper, we study availability and downtime cost of (n,k) cluster systems. By considering performance, we model state transition of (n,k) cluster systems and apply software rejuvenation technique to improve availability of the system. We find that software rejuvenation can be used to improve availability of (n,k) cluster systems.