Skip to main content


eCommons@Cornell >
College of Engineering >
Computer Science >
Computer Science Technical Reports >

Please use this identifier to cite or link to this item:
Title: Automated Analysis of Fault-Tolerance in Distributed Systems
Authors: Stoller, Scott D.
Schneider, Fred B.
Keywords: computer science
technical report
Issue Date: Nov-1996
Publisher: Cornell University
Abstract: This paper describes a method for automated analysis of fault-tolerance properties of distributed systems. The framework is based on ideas from stream-processing semantics for networks of processes and abstract interpretation of programs. The stream-processing model provides modularity and a clean algorithmic basis for the analysis. For efficiency, all aspects of a system's behavior can be approximated in the analysis, including: values (the data transmitted in messages), multiplicities (the number of times each value is sent), and orderings (the order in which values are sent). The approximation mechanisms are based on abstract interpretation, symbolic computation, and partial orders. Approximations are essential to support abstraction from aspects of a system's behavior that do not directly impact its fault-tolerance. Another feature of our approach is that perturbations due to failures can be represented explicitly. This allows fault-tolerance requirements to be expressed as bounds on the acceptable perturbations to a system's behavior as a consequence of certain failures. This facilitates separation of fault-tolerance from other correctness requirements and sometimes enables more efficient analysis. The analysis has been implemented in a prototype tool.
Appears in Collections:Computer Science Technical Reports

Files in This Item:

File Description SizeFormat
96-1614.pdf307.77 kBAdobe PDFView/Open
96-1614.ps518.47 kBPostscriptView/Open

Refworks Export

Items in eCommons are protected by copyright, with all rights reserved, unless otherwise indicated.


© 2014 Cornell University Library Contact Us