FAULTS ANALYSIS IN DISTRIBUTED SYSTEMS - Quantitative Estimation of Reliability and Resource Requirements

Christian Dauer Thorenfeldt Sellberg, Michael R. Hansen, Paul Fischer

2008

Abstract

We live in a time where we become ever more dependent on distributed computing. Predictable quantitative properties of reliability and resource requirements of these systems are of outmost importance. But today quantitative properties of these systems can only be established after the systems are implemented and released for test, at which point problems can be costly and time consuming to solve. We present a new method, a process algebra and simulation tool for estimating quantitative properties of reliability and resource requirements of a distributed system with complex behaviour hereunder complex fault-tolerance behaviour. The simulation tool allows tailored fault injection e.g. random failure and attacks. The method is based upon π-calculus (Milner, 1999) to which it adds a stochastic fail-able process group construct. Performance is quantitatively estimated using reaction rates (Priami, 1995). We show how to model and estimate quantitative properties of a CPU scavenging grid with fault-tolerance. To emphasize the expressiveness of our language called Gπ we provide design patterns for encoding higher-order functions, object-oriented classes, process translocation, conditional loops and conditional control flow. The design patterns are used to implement linked lists, higher-order list functions and binary algebra. The focus of the paper is on practical application.

References

  1. Amadio, Roberto M., 1997. An asynchronous model of locality, failure, and process mobility. In D. Garlan and D. LeMetayer, editors, Proceedings of the 2nd International Conference on Coordination Languages and Models (COORDINATION'97), volume 1282, pages 374-391, Berlin, Germany. Springer-Verlag.
  2. Fowler, Martin, 2003. UML Distilled: A Brief Guide to the Standard Object Modeling Language. 3rd Edition. The Addison-Wesley Object Technology Series.
  3. Francalanza, Adrian and Hennessy, Matthew, 2006. A theory for observational fault-tolerance. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 3921 LNCS.
  4. Milner, Robin, 1999. Communicating and Mobile Systems: the Pi-Calculus, Cambridge Univ. Press.
  5. Plotkin, G, 1981. A structural approach to operational semantics. Tech. Rep. DAIMI FN-19, Computer Science Dept., Aarhus University, Aarhus, Denmark.
  6. Priami, Corrado, 1995. Stochastic pi-Calculus. Comput. J. 38(7): 578-589.
  7. Sellberg, Christian, 2008. Model and Tool for Fault Analysis in Distributed Systems. Master Thesis. Informatics and Mathematical Modelling, Technical University of Denmark, {DTU}.
  8. Tanenbaum, Andrew S., Maarten van Steen, 2006. Distributed Systems: Principles and Paradigms. Prentice Hall; 2 edition.
Download


Paper Citation


in Harvard Style

Dauer Thorenfeldt Sellberg C., R. Hansen M. and Fischer P. (2008). FAULTS ANALYSIS IN DISTRIBUTED SYSTEMS - Quantitative Estimation of Reliability and Resource Requirements . In Proceedings of the Third International Conference on Software and Data Technologies - Volume 2: ICSOFT, ISBN 978-989-8111-52-4, pages 45-52. DOI: 10.5220/0001881700450052


in Bibtex Style

@conference{icsoft08,
author={Christian Dauer Thorenfeldt Sellberg and Michael R. Hansen and Paul Fischer},
title={FAULTS ANALYSIS IN DISTRIBUTED SYSTEMS - Quantitative Estimation of Reliability and Resource Requirements},
booktitle={Proceedings of the Third International Conference on Software and Data Technologies - Volume 2: ICSOFT,},
year={2008},
pages={45-52},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001881700450052},
isbn={978-989-8111-52-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Third International Conference on Software and Data Technologies - Volume 2: ICSOFT,
TI - FAULTS ANALYSIS IN DISTRIBUTED SYSTEMS - Quantitative Estimation of Reliability and Resource Requirements
SN - 978-989-8111-52-4
AU - Dauer Thorenfeldt Sellberg C.
AU - R. Hansen M.
AU - Fischer P.
PY - 2008
SP - 45
EP - 52
DO - 10.5220/0001881700450052