Open Access iconOpen Access

ARTICLE

crossmark

MADDPG-D2: An Intelligent Dynamic Task Allocation Algorithm Based on Multi-Agent Architecture Driven by Prior Knowledge

Tengda Li, Gang Wang, Qiang Fu*

College of Air and Missile Defense, Air Force Engineering University, Xi’an, 710051, China

* Corresponding Author: Qiang Fu. Email: email

Computer Modeling in Engineering & Sciences 2024, 140(3), 2559-2586. https://doi.org/10.32604/cmes.2024.052039

Abstract

Aiming at the problems of low solution accuracy and high decision pressure when facing large-scale dynamic task allocation (DTA) and high-dimensional decision space with single agent, this paper combines the deep reinforcement learning (DRL) theory and an improved Multi-Agent Deep Deterministic Policy Gradient (MADDPG-D2) algorithm with a dual experience replay pool and a dual noise based on multi-agent architecture is proposed to improve the efficiency of DTA. The algorithm is based on the traditional Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm, and considers the introduction of a double noise mechanism to increase the action exploration space in the early stage of the algorithm, and the introduction of a double experience pool to improve the data utilization rate; at the same time, in order to accelerate the training speed and efficiency of the agents, and to solve the cold-start problem of the training, the a priori knowledge technology is applied to the training of the algorithm. Finally, the MADDPG-D2 algorithm is compared and analyzed based on the digital battlefield of ground and air confrontation. The experimental results show that the agents trained by the MADDPG-D2 algorithm have higher win rates and average rewards, can utilize the resources more reasonably, and better solve the problem of the traditional single agent algorithms facing the difficulty of solving the problem in the high-dimensional decision space. The MADDPG-D2 algorithm based on multi-agent architecture proposed in this paper has certain superiority and rationality in DTA.

Keywords


Cite This Article

APA Style
Li, T., Wang, G., Fu, Q. (2024). MADDPG-D2: an intelligent dynamic task allocation algorithm based on multi-agent architecture driven by prior knowledge. Computer Modeling in Engineering & Sciences, 140(3), 2559-2586. https://doi.org/10.32604/cmes.2024.052039
Vancouver Style
Li T, Wang G, Fu Q. MADDPG-D2: an intelligent dynamic task allocation algorithm based on multi-agent architecture driven by prior knowledge. Comput Model Eng Sci. 2024;140(3):2559-2586 https://doi.org/10.32604/cmes.2024.052039
IEEE Style
T. Li, G. Wang, and Q. Fu, “MADDPG-D2: An Intelligent Dynamic Task Allocation Algorithm Based on Multi-Agent Architecture Driven by Prior Knowledge,” Comput. Model. Eng. Sci., vol. 140, no. 3, pp. 2559-2586, 2024. https://doi.org/10.32604/cmes.2024.052039



cc Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 654

    View

  • 317

    Download

  • 0

    Like

Share Link