Authors:
Óscar Mortágua Pereira
;
David Simões
and
Rui L. Aguiar
Affiliation:
University of Aveiro, Portugal
Keyword(s):
Fault Tolerance, Logging Mechanism, Software Architecture, Transactional System.
Related
Ontology
Subjects/Areas/Topics:
Data Engineering
;
Data Integrity
;
Databases and Data Security
;
Information and Systems Security
;
Nosql Databases
Abstract:
Fault tolerance allows a system to remain operational to some degree when some of its components fail. One of the most common fault tolerance mechanisms consists on logging the system state periodically, and recovering the system to a consistent state in the event of a failure. This paper describes a general fault tolerance logging-based mechanism, which can be layered over deterministic systems. Our proposal describes how a logging mechanism can recover the underlying system to a consistent state, even if an action or set of actions were interrupted mid-way, due to a server crash. We also propose different methods of storing the logging information, and describe how to deploy a fault tolerant master-slave cluster for information replication. We adapt our model to a previously proposed framework, which provided common relational features, like transactions with atomic, consistent, isolated and durable properties, to NoSQL database management systems.