ANTECEDENCE GRAPH APPROACH TO CHECKPOINTING FOR FAULT TOLERANCE IN MULTI AGENT SYSTEM

Rajwinder Singh, Ramandeep Kaur, Rama Krishna Challa

2010

Abstract

Checkpointing has been widely used for providing fault tolerance in multi-agent systems. But the traditional message passing based checkpointing and rollback algorithms may suffer from problems of excess bandwidth consumption and large overheads. In order to maintain consistency of multi agent system, the checkpointing is forced on all participating agents that may result in blocking of agents’ operations to carry out checkpointing. These overheads could be considerably reduced if the checkpointing would be forced only on selective agents instead of all agents. This paper presents a low latency, non-blocking checkpointing scheme which marks out dependent agents using Antecedence graphs and then checkpoints are forced on only these agents. To recover from failures, the antecedence graphs and message logs are regenerated and normal operations continued. The proposed scheme reports less overheads and reduced recovery times as compared to existing schemes.

References

  1. Nwana, Hyacinth, S., 1996. Software Agents: An Overview. Knowledge Engineering Review. Vol. 11, Cambridge University Press. pp. 1 - 40.
  2. Lyu M. R., Chen, X., Wong. T. Y., 2004. Design and Evaluation of a Fault-Tolerant Mobile-Agent System. IEEE CS Press, pp. 32-38.
  3. Elnozahy, E, Alvisi, N., L., Wang, Y, M., Johnson, D, B., 1999. Survey of Rollback-Recovery Protocols in Message- Passing Systems, Technical Report CMUCS-99-148, School Computer Science, Carnegie Mellon University.
  4. Khokhar, M, M., Nadeem, A., Paracha, O,M.,2006. An Antecedence Graph Approach for Fault Tolerance in a Multi-Agent System. Proceedings of the IEEE 7th International Conference on Mobile Data Management.
  5. Elnozahy, E, N., 1993. Manetho: Fault Tolerance in Distributed Systems Using Rollback-Recovery and Process Replication, PhD Thesis, Rice University, Houston, Texas.
  6. Meth, K, Z., Tuel, W, G., 2000. Parallel checkpoint/restart without message logging. Proceeding of IEEE 28th Int. Conf. on Parallel Processing, pp. 253-258.
  7. Bhargava, B., Lian, S, R., 1998. Independent checkpointing and concurrent rollback for recovery in distributed systems - an optimistic approach, Proceeding of 7th IEEE Symp. Reliable Distributed Syst.,pp. 3-12.
  8. Manivannan, D., Singhal, M., 1999. Quasi-synchronous checkpointing: Models, characterization, and classification, IEEE Trans. Parallel and Distributed Syst., 10(7): pp.703-713.
  9. Lange, B, Banny., 1998. Java Aglets Application Programming Interface(JAAPI) White Paper-Draft 2 , IBM Tokyo Research Laboratory.
Download


Paper Citation


in Harvard Style

Singh R., Kaur R. and Krishna Challa R. (2010). ANTECEDENCE GRAPH APPROACH TO CHECKPOINTING FOR FAULT TOLERANCE IN MULTI AGENT SYSTEM . In Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 4: ICEIS, ISBN 978-989-8425-07-2, pages 139-142. DOI: 10.5220/0002898601390142


in Bibtex Style

@conference{iceis10,
author={Rajwinder Singh and Ramandeep Kaur and Rama Krishna Challa},
title={ANTECEDENCE GRAPH APPROACH TO CHECKPOINTING FOR FAULT TOLERANCE IN MULTI AGENT SYSTEM},
booktitle={Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 4: ICEIS,},
year={2010},
pages={139-142},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002898601390142},
isbn={978-989-8425-07-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 4: ICEIS,
TI - ANTECEDENCE GRAPH APPROACH TO CHECKPOINTING FOR FAULT TOLERANCE IN MULTI AGENT SYSTEM
SN - 978-989-8425-07-2
AU - Singh R.
AU - Kaur R.
AU - Krishna Challa R.
PY - 2010
SP - 139
EP - 142
DO - 10.5220/0002898601390142