Home / Journals / CMC / Online First / doi:10.32604/cmc.2026.080961
Special Issues
Table of Content

Open Access

REVIEW

Three-Level Taxonomy of RL Self-Healing for Energy, Latency, and Security Constrained Edge IoT Networks: A Review

Hitesh Mohapatra*
School of Computer Engineering, Kalinga Institute of Industrial Technology (KIIT) Deemed to be University, Bhubaneswar, Odisha, India
* Corresponding Author: Hitesh Mohapatra. Email: email
(This article belongs to the Special Issue: AI-Driven Optimization for Secure and Sustainable Edge IoT Services)

Computers, Materials & Continua https://doi.org/10.32604/cmc.2026.080961

Received 19 February 2026; Accepted 21 April 2026; Published online 05 May 2026

Abstract

This review systematically analyzes Reinforcement Learning approaches for self-healing in energy-constrained secure edge IoT networks across 82 studies from 2020 to 2026. Unlike existing surveys that focus on general RL applications, the proposed review focuses on a three-level taxonomy that uniquely addresses edge IoT deployment realities through formulation-scope-hardware mapping. The work develops a novel three-level taxonomy classifying recovery scope (node, link, service, network), RL formulations (tabular, deep, multi-agent, model-based), and constraint integration (energy, latency, security, hybrid), revealing service migration dominance at 30% coverage and node recovery achieving 38% maximum energy savings. Normalized performance baselines establish energy gains up to 44%, latency compliance of 84% under mobility traces, and 35% security exposure reduction during failover windows. 10 evidence-based gaps emerge, including a complete absence of model-based node recovery and multi-agent network security orchestration spanning only 2 papers. 15 prioritized future directions target 70% sample efficiency gains, 35% exposure reduction under compromised agents, and 22% Pareto improvements through joint constraint optimization, providing researchers and practitioners structured roadmap for sustainable edge IoT resilience. Performance metrics are normalized against static policy baselines using logarithmic scaling and success ratios to ensure cross-study comparability.

Keywords

Reinforcement learning; edge computing; self-healing networks; energy constraints; IoT security; latency optimization; failover recovery; sustainable edge services
  • 237

    View

  • 40

    Download

  • 0

    Like

Share Link