Open Access iconOpen Access



Fault-Tolerant Communication Induced Checkpointing and Recovery Protocol Using IoT

Neha Malhotra1,2,*, Manju Bala3

1 I.K.Gujral Punjab Technical University, Kapurthala, 144603, India
2 Lovely Professional University, Phagwara, 144411, India
3 Khalsa College of Engineering and Technology, Amritsar, 143001, India

* Corresponding Author: Neha Malhotra. Email: email

(This article belongs to the Special Issue: Intelligence 4.0: Concepts and Advances in Computational Intelligence)

Intelligent Automation & Soft Computing 2021, 30(3), 945-960.


In mobile computing systems, nodes in the network take checkpoints to survive failures. Certain characteristics of mobile computing systems such as mobility, low bandwidth, disconnection, low power consumption, and limited memory make these systems more prone to failures. In this paper, a novel minimum process communication-induced checkpointing algorithm that makes full use of the computation ability and implementation of effective stable storage in a mobile computing system is proposed. The said approach initiates by taking spontaneous checkpoints by each node in phase 1 using a logistic function that is specifically used to estimate the time interval between two checkpoints and saves them locally. In phase 2, each node takes checkpoints in a coordinated manner using the Takagi–Sugeno (T–S) fuzzy system, which generates results based on the interpretation of 39 rules specifically incorporated in the system to avoid unnecessary and irrelevant checkpoints. Finally, the permanent checkpoints are stored on IoT(Internet of things) to reduce the storage capacity of the system. Quantitative analysis and experimental simulation prove that the proposed scheme outperforms other communication-induced checkpointing schemes in terms of the minimum number of processes required to take the checkpoints and communication cost. Simulation results prove that the checkpointing process becomes faster as compared to the existing techniques due to a decrease in the latency wrt to the number of nodes and storage of permanent checkpoints on IoT. With the advent of network partitioning in the proposed system, the average computation loss has also reduced as the rollback recovery takes place in a particular partition only, not in the entire network. The overall approach makes a mobile distributed computing system fault tolerant while non-blocking of the processes during the checkpointing process.


Cite This Article

APA Style
Malhotra, N., Bala, M. (2021). Fault-tolerant communication induced checkpointing and recovery protocol using iot. Intelligent Automation & Soft Computing, 30(3), 945-960.
Vancouver Style
Malhotra N, Bala M. Fault-tolerant communication induced checkpointing and recovery protocol using iot. Intell Automat Soft Comput . 2021;30(3):945-960
IEEE Style
N. Malhotra and M. Bala, "Fault-Tolerant Communication Induced Checkpointing and Recovery Protocol Using IoT," Intell. Automat. Soft Comput. , vol. 30, no. 3, pp. 945-960. 2021.

cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1496


  • 916


  • 0


Share Link