Open Access iconOpen Access

ARTICLE

DRL-AMIR: Intelligent Flow Scheduling for Software-Defined Zero Trust Networks

Wenlong Ke1,2,*, Zilong Li1, Peiyu Chen1, Benfeng Chen1, Jinglin Lv1, Qiang Wang2, Ziyi Jia2, Shigen Shen1

1 School of Information Engineering, Huzhou University, Huzhou, 313000, China
2 National Key Laboratory of Advanced Communication Networks, Shijiazhuang, 050050, China

* Corresponding Author: Wenlong Ke. Email: email

Computers, Materials & Continua 2025, 84(2), 3305-3319. https://doi.org/10.32604/cmc.2025.065665

Abstract

Zero Trust Network (ZTN) enhances network security through strict authentication and access control. However, in the ZTN, optimizing flow control to improve the quality of service is still facing challenges. Software Defined Network (SDN) provides solutions through centralized control and dynamic resource allocation, but the existing scheduling methods based on Deep Reinforcement Learning (DRL) are insufficient in terms of convergence speed and dynamic optimization capability. To solve these problems, this paper proposes DRL-AMIR, which is an efficient flow scheduling method for software defined ZTN. This method constructs a flow scheduling optimization model that comprehensively considers service delay, bandwidth occupation, and path hops. Additionally, it balances the differentiated requirements of delay-critical K-flows, bandwidth-intensive D-flows, and background B-flows through adaptive weighting. The proposed framework employs a customized state space comprising node labels, link bandwidth, delay metrics, and path length. It incorporates an action space derived from node weights and a hybrid reward function that integrates both single-step and multi-step excitation mechanisms. Based on these components, a hierarchical architecture is designed, effectively integrating the data plane, control plane, and knowledge plane. In particular, the adaptive expert mechanism is introduced, which triggers the shortest path algorithm in the training process to accelerate convergence, reduce trial and error costs, and maintain stability. Experiments across diverse real-world network topologies demonstrate that DRL-AMIR achieves a 15–20% reduction in K-flow transmission delays, a 10–15% improvement in link bandwidth utilization compared to SPR, QoSR, and DRSIR, and a 30% faster convergence speed via adaptive expert mechanisms.

Keywords

Zero trust network; software-defined networking; deep reinforcement learning; flow scheduling

Cite This Article

APA Style
Ke, W., Li, Z., Chen, P., Chen, B., Lv, J. et al. (2025). DRL-AMIR: Intelligent Flow Scheduling for Software-Defined Zero Trust Networks. Computers, Materials & Continua, 84(2), 3305–3319. https://doi.org/10.32604/cmc.2025.065665
Vancouver Style
Ke W, Li Z, Chen P, Chen B, Lv J, Wang Q, et al. DRL-AMIR: Intelligent Flow Scheduling for Software-Defined Zero Trust Networks. Comput Mater Contin. 2025;84(2):3305–3319. https://doi.org/10.32604/cmc.2025.065665
IEEE Style
W. Ke et al., “DRL-AMIR: Intelligent Flow Scheduling for Software-Defined Zero Trust Networks,” Comput. Mater. Contin., vol. 84, no. 2, pp. 3305–3319, 2025. https://doi.org/10.32604/cmc.2025.065665



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 313

    View

  • 123

    Download

  • 0

    Like

Share Link