Graph-Based Constrained PPO for Low-Latency and Energy-Aware AI Agent Migration in Internet of Vehicular Agents

Kanyang Jiang¹, Yingkai Kang², Ming Li^2,*
1 School of Automation, Guangdong University of Technology and Key Laboratory of Intelligent Detection and the Internet of Things in Manufacturing, Ministry of Education, Guangzhou, China
2 School of Automation, Guangdong University of Technology, Guangzhou, China
* Corresponding Author: Ming Li. Email: email
(This article belongs to the Special Issue: AI-Driven Optimization for Secure and Sustainable Edge IoT Services)

Computers, Materials & Continua https://doi.org/10.32604/cmc.2026.083294

Received 01 April 2026; Accepted 03 May 2026; Published online 20 May 2026

Download PDF

Abstract

The Internet of Vehicular Agents (IoVA) interconnects distributed AI agents across vehicular networks to deliver real-time intelligent services for vehicular users. Due to the limited computing capacity of vehicles, AI agents are deployed on nearby RoadSide Units (RSUs) to perform computation-intensive inference. As vehicles traverse RSU coverage boundaries, AI agents must migrate to target RSUs to maintain service continuity. However, the communication and computing resources at each RSU are shared among multiple co-served vehicles, creating coupled allocation decisions that jointly determine system latency and energy consumption. To address this challenge, we propose a low-latency and energy-aware AI agent migration framework that models the end-to-end system latency and vehicle energy consumption in the IoVA. Since the cumulative nature of energy consumption introduces long-term constraints that cannot be handled by instantaneous optimization, we formulate the resource allocation problem as a constrained Markov decision process and develop a Graph-based Constrained Proximal Policy Optimization (GCPPO) algorithm to solve it. GCPPO employs a bidirectional graph attention network to extract the relational features between heterogeneous vehicles and RSUs, thereby enabling topology-aware resource allocation, and adopts a Lagrangian dual mechanism to adaptively enforce the long-term energy constraints. Simulation results demonstrate the effectiveness and scalability of the proposed algorithm, which achieves a 31.3% reduction in average system latency over baselines while attaining a 96.4% constraint satisfaction rate.

Keywords

Internet of vehicular agents; AI agent migration; constrained deep reinforcement learning; graph attention network; resource allocation

Downloads
- Full-Text PDF
Citation Tools
- BibTex
- EndNote
- RIS

226

View
45

Download
0

Like

Optimal Resource Allocation for NOMA Wireless Networks
Fahad R. Albogamy, M. A. Aiyashi,...
Light-Weighted Decision Support Framework for Selecting Cloud Service Providers
Abdulmajeed Aljuhani, Abdulaziz...
Resource Management in UAV Enabled MEC Networks
Muhammad Abrar, Ziyad M. Almohaimeed,...
Weighted De-Synchronization Based Resource Allocation in Wireless Networks
Kimchheang Chhea, Dara Ron, Jung-Ryun...
An Optimal Algorithm for Resource Allocation in D2D Communication
Shahad Alyousif, Mohammed Dauwed,...

All issues

Online First

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

Graph-Based Constrained PPO for Low-Latency and Energy-Aware AI Agent Migration in Internet of Vehicular Agents

Abstract

Keywords

226

45

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link