Open Access iconOpen Access

ARTICLE

Graph-Based Constrained PPO for Low-Latency and Energy-Aware AI Agent Migration in Internet of Vehicular Agents

Kanyang Jiang1, Yingkai Kang2, Ming Li2,*

1 School of Automation, Guangdong University of Technology and Key Laboratory of Intelligent Detection and the Internet of Things in Manufacturing, Ministry of Education, Guangzhou, China
2 School of Automation, Guangdong University of Technology, Guangzhou, China

* Corresponding Author: Ming Li. Email: email

(This article belongs to the Special Issue: AI-Driven Optimization for Secure and Sustainable Edge IoT Services)

Computers, Materials & Continua 2026, 88(2), 61 https://doi.org/10.32604/cmc.2026.083294

Abstract

The Internet of Vehicular Agents (IoVA) interconnects distributed AI agents across vehicular networks to deliver real-time intelligent services for vehicular users. Due to the limited computing capacity of vehicles, AI agents are deployed on nearby RoadSide Units (RSUs) to perform computation-intensive inference. As vehicles traverse RSU coverage boundaries, AI agents must migrate to target RSUs to maintain service continuity. However, the communication and computing resources at each RSU are shared among multiple co-served vehicles, creating coupled allocation decisions that jointly determine system latency and energy consumption. To address this challenge, we propose a low-latency and energy-aware AI agent migration framework that models the end-to-end system latency and vehicle energy consumption in the IoVA. Since the cumulative nature of energy consumption introduces long-term constraints that cannot be handled by instantaneous optimization, we formulate the resource allocation problem as a constrained Markov decision process and develop a Graph-based Constrained Proximal Policy Optimization (GCPPO) algorithm to solve it. GCPPO employs a bidirectional graph attention network to extract the relational features between heterogeneous vehicles and RSUs, thereby enabling topology-aware resource allocation, and adopts a Lagrangian dual mechanism to adaptively enforce the long-term energy constraints. Simulation results demonstrate the effectiveness and scalability of the proposed algorithm, which achieves a 31.3% reduction in average system latency over baselines while attaining a 96.4% constraint satisfaction rate.

Keywords

Internet of vehicular agents; AI agent migration; constrained deep reinforcement learning; graph attention network; resource allocation

Cite This Article

APA Style
Jiang, K., Kang, Y., Li, M. (2026). Graph-Based Constrained PPO for Low-Latency and Energy-Aware AI Agent Migration in Internet of Vehicular Agents. Computers, Materials & Continua, 88(2), 61. https://doi.org/10.32604/cmc.2026.083294
Vancouver Style
Jiang K, Kang Y, Li M. Graph-Based Constrained PPO for Low-Latency and Energy-Aware AI Agent Migration in Internet of Vehicular Agents. Comput Mater Contin. 2026;88(2):61. https://doi.org/10.32604/cmc.2026.083294
IEEE Style
K. Jiang, Y. Kang, and M. Li, “Graph-Based Constrained PPO for Low-Latency and Energy-Aware AI Agent Migration in Internet of Vehicular Agents,” Comput. Mater. Contin., vol. 88, no. 2, pp. 61, 2026. https://doi.org/10.32604/cmc.2026.083294



cc Copyright © 2026 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 300

    View

  • 69

    Download

  • 0

    Like

Share Link