• Title/Summary/Keyword: Software fault-tolerance

Search Result 89, Processing Time 0.031 seconds

Error Recovery System for Digital Carousel System running on Home Network (홈 네트워크에서 디지털 캐로절 시스템을 위한 오류 복구 시스템)

  • Ko, Eung-Nam
    • Journal of Digital Contents Society
    • /
    • v.9 no.4
    • /
    • pp.785-790
    • /
    • 2008
  • Our Digital Carousel enables user to share media objects through media synchronization mechanism. We implemented the Digital Carousel so that the users participated in collaborative work may refer shared media or error objects as the same view to others. In this paper, we discuss a method for increasing reliability through fault tolerance. We describe the design and implementation of the ER running on distributed multimedia environment. ER is a system which is able to recover automatically a software error based on distributed multimedia. This paper explains a performance analysis of an error recovery system running on distributed multimedia environment using the rule-based DEVS modeling and simulation techniques. In DEVS, a system has a time base, inputs, states, outputs, and functions.

  • PDF

An Architecture to Monitor Real-Time Objects in FTB Stub Approach (결함허용 중개자 스터브 방식에서 실시간객체를 감시하는 구조)

  • Im, Hyeong-Taek;Yang, Seung-Min
    • Journal of KIISE:Software and Applications
    • /
    • v.28 no.1
    • /
    • pp.1-13
    • /
    • 2001
  • RMO(Region Monitor Object)는 결함전파나 객체군에 주어진 요구사항의 위반에 의해 발생하는 오류를 처리하는 실시간객체로써 여러 실시간객체의 상태를 감시 및 분석하여 오류를 감지하고, 증상을 진단한 후 알맞은 복구 및 재구성을 실행하다, 이를 위하여 RMO는 응용 실시간객체를 감시할 수 있는 권한을 갖는다. RMO의 권한을 지원해주는 구조는 결함허용 중개자를 이용한다. 결함허용 중개자(FTB 또는 Fault Tolerance Broker)는 RMO가 응용 실시간객체를 감시할 때에 응용의 설계와 응용의 위치에 투명하게 수행될 수 있게 중개자 역할을 한다. 제안하는 감시 구조에는 결함허용 중개자가 응용 실사간객체마다 스터브로 붙는 스터브 방식과 각 노드의 커널에 모듈로 존재하는 커널 모듈 방식이 있다. 본 논문은 스터브 방식에서 RMO가 응용 실시간객체를 감시하는 구조를 제시하고 구현한다. 결함허용 중개자 스터브는 응용 실시간객체와 같은 주소 공간에 존재하면서 응용 실시간객체에서 발생하는 메세지를 가로채고 소속자료에 접근한다. RMO는 결함허용 중개자 스터브가 제공하는 인터페이스를 통해서 응용 실시간객체에 대한 감시 정보를 얻는다. 제안한 감시 구조는 실시간객체 모델인 dRTO(dependable RTO) 모델에 기반하여 설계하였고 실시간 커널인 dKernel 상에서 구현 및 실험하였으나 다른 모델이나 커널에도 적용될 수 있다.

  • PDF

Approach towards qualification of TCP/IP network components of PFBR

  • Aditya Gour;Tom Mathews;R.P. Behera
    • Nuclear Engineering and Technology
    • /
    • v.54 no.11
    • /
    • pp.3975-3984
    • /
    • 2022
  • Distributed control system architecture is adopted for I&C systems of Prototype Fast Breeder Reactor, where the geographically distributed control systems are connected to centralized servers & display stations via switched Ethernet networks. TCP/IP communication plays a significant role in the successful operations of this architecture. The communication tasks at control nodes are taken care by TCP/IP offload modules; local area switched network is realized using layer-2/3 switches, which are finally connected to network interfaces of centralized servers & display stations. Safety, security, reliability, and fault tolerance of control systems used for safety-related applications of nuclear power plants is ensured by indigenous design and qualification as per guidelines laid down by regulatory authorities. In the case of commercially available components, appropriate suitability analysis is required for getting the operation clearances from regulatory authorities. This paper details the proposed approach for the suitability analysis of TCP/IP communication nodes, including control systems at the field, network switches, and servers/display stations. Development of test platform using commercially available tools and diagnostics software engineered for control nodes/display stations are described. Each TCP link behavior with impaired packets and multiple traffic loads is described, followed by benchmarking of the network switch's routing characteristics and security features.

The Study of quality measurement plan for software reliability (소프트웨어 신뢰도 품질측정 방안에 관한 연구)

  • Lim, Kwang-Hyeon;Ryu, Kwang-Sun;Shon, Ho-Sun;Ryu, Keun-Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.12
    • /
    • pp.187-198
    • /
    • 2012
  • International standard for software quality evaluation, ISO/IEC 9126 defines the quality characteristics and sub-characteristics consisting of quality of the software. Also, in this standard, the quality assessment items are presented in metrics based on maturity, fault tolerance, recoverability and compliance which are sub-characteristics of reliability. However, this reliability assessment items which are presented are not applied now due to the mathematical difficulties and the problems caused by collection of data. In order to resolve these problems, this study introduces measures to develop a specific test technique by quality attributes and to measure the quality. In other words, the detailed guidelines are presented to be utilized as a standard by summarizing test activities and techniques by each development life the most commonly used. The SW quality results are derived by assigning weights to software in order of the most important quality characteristics of respective software using checklist that can be easily utilized and applying test technique that can be utilized for detailed evaluation items and checklist. This improves problems that include reliability assessment item of ISO/IEC 9126 as the measures to measure new reliability quality of software.

Methods for Enhancing Reliability of On-Ground IoT Applications (지상용 IoT 애플리케이션의 신뢰성 향상 기법)

  • Shin, Dong Ha;Han, Seung Ho;Kim, Soo Dong;Her, Jin Sun
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.4 no.4
    • /
    • pp.151-160
    • /
    • 2015
  • Internet-of-Things(IoT) is the computing environment to provide valuable services by interacting with multiple devices, where diverse devices are connected within the existing Internet infrastructure and acquire context information by sensing. As the concern of IoT has been increased recently, most of the industries develop many IoT devices. And, many people are focused on the IoT application that is utilizing different technologies, which are sensor network, communication technologies, and software engineering. Developing on-ground IoT application is especially even more active in progress depending on increasing of on-ground IoT devices because it is possible for them to access dangerous and inaccessible situation. However, There are a few studies related IoT. Moreover, since on-ground IoT application, which is different from typical software application, has to consider device's characteristics, communication, and surround condition, it reveal challenges, decreasing reliability. Therefore, in this paper, we analyze reliability challenges related to maturity and fault tolerance, one of reliability attributes, occurring in developing on-ground IoT applications and suggest the effective solutions to resolve the challenges. To verify proposed the challenges and solutions, we show result that is applying the solutions to applications. By presenting the case study, we evaluate the effectiveness of applying the solutions to the application.

A Study on The Dangers and Their Countermeasures of Autonomous Vehicle (자율주행자동차 위험 및 대응방안에 대한 고찰)

  • Jung, Im Y.
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.6
    • /
    • pp.90-98
    • /
    • 2020
  • Modern vehicles are evolving from manual to automatic driving. As the ratio of electrical equipment and software increases inside the vehicle, vehicles that support autonomous driving are becoming another open computer system that can communicate with the outside. The safety of the vehicle means the safety of both the passenger and the non-passenger. It is not clear whether the safety problem of ultimate autonomous vehicles can be solved by the current solution of computer systems related to fault tolerance and security. Autonomous vehicles should not be dangerous to people after they are released to the market, so it is necessary to proactively diagnose all the risks that can be predicted with current technology. This paper examines the current developments of autonomous vehicles and analyzes their dangers that threaten driving safety, as well as their countermeasures.

Pub/Sub-based Sensor virtualization framework for Cloud environment

  • Ullah, Mohammad Hasmat;Park, Sung-Soon;Nob, Jaechun;Kim, Gyeong Hun
    • International journal of advanced smart convergence
    • /
    • v.4 no.2
    • /
    • pp.109-119
    • /
    • 2015
  • The interaction between wireless sensors such as Internet of Things (IoT) and Cloud is a new paradigm of communication virtualization to overcome resource and efficiency restriction. Cloud computing provides unlimited platform, resources, services and also covers almost every area of computing. On the other hand, Wireless Sensor Networks (WSN) has gained attention for their potential supports and attractive solutions such as IoT, environment monitoring, healthcare, military, critical infrastructure monitoring, home and industrial automation, transportation, business, etc. Besides, our virtual groups and social networks are in main role of information sharing. However, this sensor network lacks resource, storage capacity and computational power along with extensibility, fault-tolerance, reliability and openness. These data are not available to community groups or cloud environment for general purpose research or utilization yet. If we reduce the gap between real and virtual world by adding this WSN driven data to cloud environment and virtual communities, then it can gain a remarkable attention from all over, along with giving us the benefit in various sectors. We have proposed a Pub/Sub-based sensor virtualization framework Cloud environment. This integration provides resource, service, and storage with sensor driven data to the community. We have virtualized physical sensors as virtual sensors on cloud computing, while this middleware and virtual sensors are provisioned automatically to end users whenever they required. Our architecture provides service to end users without being concerned about its implementation details. Furthermore, we have proposed an efficient content-based event matching algorithm to analyze subscriptions and to publish proper contents in a cost-effective manner. We have evaluated our algorithm which shows better performance while comparing to that of previously proposed algorithms.

Study on the efficient consensus process of PBFT

  • Min, Youn-A
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.4
    • /
    • pp.47-53
    • /
    • 2020
  • Blockchain is a distributed shared ledger that transparently manages information through verification and agreement between nodes connected to a distributed network. Recently, cases of data management among authorized agencies based on private blockchain are increasing. In this paper, we investigated the application cases and technical processes of PBFT, the representative consensus algorithm of private blockchain, and proposed a modified PBFT algorithm that enables efficient consensus by simplifying duplicate verification and consensus processes that occur during PBFT processing. The algorithm proposed in this paper goes through the process of selecting a delegation node through an authoritative node and can increase the safety of the delegation node selection process by considering an efficient re-election algorithm for candidate nodes. By utilizing this research, it is possible to reduce the burden on the network communication cost of the consensus process and effectively process the final consensus process between nodes.

Synchronization Method Design of Redundant Flight Control Computer for UAV (무인기를 위한 이중화 비행제어컴퓨터의 동기화 설계)

  • Lee, Young Seo;Kang, Shin Woo;Lee, Hee Gon;Ahn, Tae-Sik
    • Journal of Advanced Navigation Technology
    • /
    • v.25 no.4
    • /
    • pp.273-279
    • /
    • 2021
  • A flight control computer(FLCC) applied to an unmanned aerial vehicle(UAV) is a safety-critical item, and which is designed in a multiple structure to increase the reliability of operation by securing fault tolerance. These FLCC of multiple structure should be designed so that each independent processing/control components can perform the same operation at the same time. And for this reason, a synchronization algorithm for synchronizing the operation between FLCCs should be included in an operational flight program. In this paper, we propose a software design method for synchronization between dual FLCCs applied to UAVs. The proposed synchronization method is designed to synchronize using only the minimum hardware resources to reduce a failure rate. In addition, the proposed synchronization method is designed to minimized synchronization errors due to a timer operation by designing in consideration of operation characteristics of the hardware timer used for the synchronization.