Distributed Application Checkpointing for Replicated State Machines

Özdinç Çelikel; Tolga Ovatman

doi:10.12694/scpe.v22i1.1840

PDF

Published: Feb 9, 2021

DOI: https://doi.org/10.12694/scpe.v22i1.1840

Keywords:

Application Checkpointing Replicated State Machines Serverless Computing

Özdinç Çelikel

Department of Computer Engineering, Istanbul Technical University, Turkey

https://orcid.org/0000-0002-1151-7879

Tolga Ovatman

Department of Computer Engineering, Istanbul Technical University, Turkey

https://orcid.org/0000-0001-5918-3145

Abstract

Application checkpointing is a widely used recovery mechanism that consists of saving an application's state periodically to be used in case of a failure. In this study we investigate the utilisation of distributed checkpointing for replicated state machines. Conventionally, for replicated state machines, checkpointing information is stored in a replicated way in each of the replicas or separately in a single instance. Applying distributed checkpointing provides a means to adjust the level of fault tolerance of the checkpointing approach by giving away from recovery time. We use a local cluster and cloud environment to examine the effects of distributed checkpointing in a simple state machine example and compare the results with conventional approaches. As expected, distributed checkpointing gains from memory consumption and utilise different levels of fault tolerance while performing worse in terms of recovery time.

Issue

Vol. 22 No. 1 (2021)

Section

Research Papers

Author Biography

Tolga Ovatman, Department of Computer Engineering, Istanbul Technical University, Turkey

Tolga Ovatman received the B.Sc. degree in computer engineering from Hacettepe University, Turkey, in 1999, and the M.Sc. and Ph.D. degrees in computer engineering from Istanbul Technical University (ITU), Turkey, in 2005 and 2011, respectively. He is an Associate Professor with the Computer Engineering Department, ITU. His research interests include cloud computing, model checking, parallel programming, and object-oriented design.

Article Sidebar

Main Article Content

Abstract

Article Details

Tolga Ovatman, Department of Computer Engineering, Istanbul Technical University, Turkey