Open Access iconOpen Access

ARTICLE

A PPO-Based DRL Approach for Scalable Communication in Civilian UAV Networks

Chu Thi Minh Hue1, Nguyen Minh Quy2,*

1 Faculty of Software Engineering, FPT University, Hanoi, Vietnam
2 Faculty of Information Technology, Hung Yen University of Technology and Education, Hungyen, Vietnam

* Corresponding Author: Nguyen Minh Quy. Email: email

(This article belongs to the Special Issue: AI-Driven Next-Generation Networks: Innovations, Challenges, and Applications)

Computers, Materials & Continua 2026, 87(2), 79 https://doi.org/10.32604/cmc.2026.074398

Abstract

Nowadays, Unmanned Aerial Vehicles (UAVs) are making increasingly important contributions to numerous applications that enhance human quality of life, such as sensing and data collection, computing, and communication. However, communication between UAVs still faces challenges due to high-dynamic topology, volatile wireless links, and strict energy budgets. In this work, we introduce an improved communication scheme, namely Proximal Policy Optimization (PPO). Our solution casts hop–by–hop relay selection as a Markov decision process and develops a decentralized Proximal Policy Optimization framework in an actor–critic form. A key novelty is the design of the reward function, which jointly considers the delivery ratio, end-to-end delay, and energy efficiency, enabling flexible prioritization in dynamic environments. The simulation results across swarms of 20–70 UAVs show that, the proposed framework enhances delivery ratio to 5% over a Deep Q-Network baseline (reaching 80% at 70 nodes), reduces latency by about 2–3 ms in medium-to-dense settings (from 43 to 35–36 ms), and attains comparable or slightly lower total energy consumption (typically 0.5%–2% lower). The results indicate that the proposed communication scheme, adaptive and scalable learning-based UAV scenarios, pave the way for re-world UAV deployments.

Keywords

Reinforcement learning; proximal policy optimization (PPO); UAV; 6G

Cite This Article

APA Style
Hue, C.T.M., Quy, N.M. (2026). A PPO-Based DRL Approach for Scalable Communication in Civilian UAV Networks. Computers, Materials & Continua, 87(2), 79. https://doi.org/10.32604/cmc.2026.074398
Vancouver Style
Hue CTM, Quy NM. A PPO-Based DRL Approach for Scalable Communication in Civilian UAV Networks. Comput Mater Contin. 2026;87(2):79. https://doi.org/10.32604/cmc.2026.074398
IEEE Style
C. T. M. Hue and N. M. Quy, “A PPO-Based DRL Approach for Scalable Communication in Civilian UAV Networks,” Comput. Mater. Contin., vol. 87, no. 2, pp. 79, 2026. https://doi.org/10.32604/cmc.2026.074398



cc Copyright © 2026 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 734

    View

  • 420

    Download

  • 0

    Like

Share Link