Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (18)
  • Open Access

    ARTICLE

    DRL-Based Task Scheduling and Trajectory Control for UAV-Assisted MEC Systems

    Sai Xu1,*, Jun Liu1,*, Shengyu Huang1, Zhi Li2

    CMC-Computers, Materials & Continua, Vol.86, No.3, 2026, DOI:10.32604/cmc.2025.071865 - 12 January 2026

    Abstract In scenarios where ground-based cloud computing infrastructure is unavailable, unmanned aerial vehicles (UAVs) act as mobile edge computing (MEC) servers to provide on-demand computation services for ground terminals. To address the challenge of jointly optimizing task scheduling and UAV trajectory under limited resources and high mobility of UAVs, this paper presents PER-MATD3, a multi-agent deep reinforcement learning algorithm with prioritized experience replay (PER) into the Centralized Training with Decentralized Execution (CTDE) framework. Specifically, PER-MATD3 enables each agent to learn a decentralized policy using only local observations during execution, while leveraging a shared replay buffer with More >

  • Open Access

    ARTICLE

    AquaTree: Deep Reinforcement Learning-Driven Monte Carlo Tree Search for Underwater Image Enhancement

    Chao Li1,3,#, Jianing Wang1,3,#, Caichang Ding2,*, Zhiwei Ye1,3

    CMC-Computers, Materials & Continua, Vol.86, No.3, 2026, DOI:10.32604/cmc.2025.071242 - 12 January 2026

    Abstract Underwater images frequently suffer from chromatic distortion, blurred details, and low contrast, posing significant challenges for enhancement. This paper introduces AquaTree, a novel underwater image enhancement (UIE) method that reformulates the task as a Markov Decision Process (MDP) through the integration of Monte Carlo Tree Search (MCTS) and deep reinforcement learning (DRL). The framework employs an action space of 25 enhancement operators, strategically grouped for basic attribute adjustment, color component balance, correction, and deblurring. Exploration within MCTS is guided by a dual-branch convolutional network, enabling intelligent sequential operator selection. Our core contributions include: (1) a More >

  • Open Access

    ARTICLE

    Research on Vehicle Joint Radar Communication Resource Optimization Method Based on GNN-DRL

    Zeyu Chen1, Jian Sun2,*, Zhengda Huan1, Ziyi Zhang1

    CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-17, 2026, DOI:10.32604/cmc.2025.071182 - 09 December 2025

    Abstract To address the issues of poor adaptability in resource allocation and low multi-agent cooperation efficiency in Joint Radar and Communication (JRC) systems under dynamic environments, an intelligent optimization framework integrating Deep Reinforcement Learning (DRL) and Graph Neural Network (GNN) is proposed. This framework models resource allocation as a Partially Observable Markov Game (POMG), designs a weighted reward function to balance radar and communication efficiencies, adopts the Multi-Agent Proximal Policy Optimization (MAPPO) framework, and integrates Graph Convolutional Networks (GCN) and Graph Sample and Aggregate (GraphSAGE) to optimize information interaction. Simulations show that, compared with traditional methods More >

  • Open Access

    ARTICLE

    DRL-Based Cross-Regional Computation Offloading Algorithm

    Lincong Zhang1, Yuqing Liu1, Kefeng Wei2, Weinan Zhao1, Bo Qian1,*

    CMC-Computers, Materials & Continua, Vol.86, No.1, pp. 1-18, 2026, DOI:10.32604/cmc.2025.069108 - 10 November 2025

    Abstract In the field of edge computing, achieving low-latency computational task offloading with limited resources is a critical research challenge, particularly in resource-constrained and latency-sensitive vehicular network environments where rapid response is mandatory for safety-critical applications. In scenarios where edge servers are sparsely deployed, the lack of coordination and information sharing often leads to load imbalance, thereby increasing system latency. Furthermore, in regions without edge server coverage, tasks must be processed locally, which further exacerbates latency issues. To address these challenges, we propose a novel and efficient Deep Reinforcement Learning (DRL)-based approach aimed at minimizing average… More >

  • Open Access

    ARTICLE

    Evaluating Domain Randomization Techniques in DRL Agents: A Comparative Study of Normal, Randomized, and Non-Randomized Resets

    Abubakar Elsafi*

    CMES-Computer Modeling in Engineering & Sciences, Vol.144, No.2, pp. 1749-1766, 2025, DOI:10.32604/cmes.2025.066449 - 31 August 2025

    Abstract Domain randomization is a widely adopted technique in deep reinforcement learning (DRL) to improve agent generalization by exposing policies to diverse environmental conditions. This paper investigates the impact of different reset strategies, normal, non-randomized, and randomized, on agent performance using the Deep Deterministic Policy Gradient (DDPG) and Twin Delayed DDPG (TD3) algorithms within the CarRacing-v2 environment. Two experimental setups were conducted: an extended training regime with DDPG for 1000 steps per episode across 1000 episodes, and a fast execution setup comparing DDPG and TD3 for 30 episodes with 50 steps per episode under constrained computational… More >

  • Open Access

    ARTICLE

    An IoT-Enabled Hybrid DRL-XAI Framework for Transparent Urban Water Management

    Qamar H. Naith1,*, H. Mancy2,3

    CMES-Computer Modeling in Engineering & Sciences, Vol.144, No.1, pp. 387-405, 2025, DOI:10.32604/cmes.2025.066917 - 31 July 2025

    Abstract Effective water distribution and transparency are threatened with being outrightly undermined unless the good name of urban infrastructure is maintained. With improved control systems in place to check leakage, variability of pressure, and conscientiousness of energy, issues that previously went unnoticed are now becoming recognized. This paper presents a grandiose hybrid framework that combines Multi-Agent Deep Reinforcement Learning (MADRL) with Shapley Additive Explanations (SHAP)-based Explainable AI (XAI) for adaptive and interpretable water resource management. In the methodology, the agents perform decentralized learning of the control policies for the pumps and valves based on the real-time… More >

  • Open Access

    ARTICLE

    Enhancing Bandwidth Allocation Efficiency in 5G Networks with Artificial Intelligence

    Sarmad K. Ibrahim1,*, Saif A. Abdulhussien2, Hazim M. ALkargole1, Hassan H. Qasim1

    CMC-Computers, Materials & Continua, Vol.84, No.3, pp. 5223-5238, 2025, DOI:10.32604/cmc.2025.066548 - 30 July 2025

    Abstract The explosive growth of data traffic and heterogeneous service requirements of 5G networks—covering Enhanced Mobile Broadband (eMBB), Ultra-Reliable Low Latency Communication (URLLC), and Massive Machine Type Communication (mMTC)—present tremendous challenges to conventional methods of bandwidth allocation. A new deep reinforcement learning-based (DRL-based) bandwidth allocation system for real-time, dynamic management of 5G radio access networks is proposed in this paper. Unlike rule-based and static strategies, the proposed system dynamically updates itself according to shifting network conditions such as traffic load and channel conditions to maximize the achievable throughput, fairness, and compliance with QoS requirements. By using… More >

  • Open Access

    ARTICLE

    An SAC-AMBER Algorithm for Flexible Job Shop Scheduling with Material Kit

    Bo Li, Xiaoying Yang*, Zhijie Pei, Xin Yang, Yaqi Wu

    CMC-Computers, Materials & Continua, Vol.84, No.2, pp. 3649-3672, 2025, DOI:10.32604/cmc.2025.066267 - 03 July 2025

    Abstract It is well known that the kit completeness of parts processed in the previous stage is crucial for the subsequent manufacturing stage. This paper studies the flexible job shop scheduling problem (FJSP) with the objective of material kitting, where a material kit is a collection of components that ensures that a batch of components can be ready at the same time during the product assembly process. In this study, we consider completion time variance and maximum completion time as scheduling objectives, continue the weighted summation process for multiple objectives, and design adaptive weighted summation parameters… More >

  • Open Access

    ARTICLE

    DRL-AMIR: Intelligent Flow Scheduling for Software-Defined Zero Trust Networks

    Wenlong Ke1,2,*, Zilong Li1, Peiyu Chen1, Benfeng Chen1, Jinglin Lv1, Qiang Wang2, Ziyi Jia2, Shigen Shen1

    CMC-Computers, Materials & Continua, Vol.84, No.2, pp. 3305-3319, 2025, DOI:10.32604/cmc.2025.065665 - 03 July 2025

    Abstract Zero Trust Network (ZTN) enhances network security through strict authentication and access control. However, in the ZTN, optimizing flow control to improve the quality of service is still facing challenges. Software Defined Network (SDN) provides solutions through centralized control and dynamic resource allocation, but the existing scheduling methods based on Deep Reinforcement Learning (DRL) are insufficient in terms of convergence speed and dynamic optimization capability. To solve these problems, this paper proposes DRL-AMIR, which is an efficient flow scheduling method for software defined ZTN. This method constructs a flow scheduling optimization model that comprehensively considers… More >

  • Open Access

    REVIEW

    A Comprehensive Overview and Comparative Analysis on Deep Learning Models

    Farhad Mortezapour Shiri*, Thinagaran Perumal, Norwati Mustapha, Raihani Mohamed

    Journal on Artificial Intelligence, Vol.6, pp. 301-360, 2024, DOI:10.32604/jai.2024.054314 - 20 November 2024

    Abstract Deep learning (DL) has emerged as a powerful subset of machine learning (ML) and artificial intelligence (AI), outperforming traditional ML methods, especially in handling unstructured and large datasets. Its impact spans across various domains, including speech recognition, healthcare, autonomous vehicles, cybersecurity, predictive analytics, and more. However, the complexity and dynamic nature of real-world problems present challenges in designing effective deep learning models. Consequently, several deep learning models have been developed to address different problems and applications. In this article, we conduct a comprehensive survey of various deep learning models, including Convolutional Neural Network (CNN), Recurrent… More >

Displaying 1-10 on page 1 of 18. Per Page