Energy-Efficient ASTAR-RIS and WPT-Assisted Task Offloading and Content Caching for WSNs

Xiaoping Yang; Songjie Yang; Junqi Long; Quanzeng Wang; Bin Yang; Xiaofang Cao; Guochao Qi

doi:10.32604/cmc.2026.078105

icon Open Access

ARTICLE

Energy-Efficient ASTAR-RIS and WPT-Assisted Task Offloading and Content Caching for WSNs

Xiaoping Yang^1,*, Songjie Yang², Junqi Long¹, Quanzeng Wang³, Bin Yang⁴, Xiaofang Cao⁵, Guochao Qi⁶

1 College of Computer Science, Beijing University of Technology, Beijing, China
2 National Key Laboratory of Wireless Communications, University of Electronic Science and Technology of China, Chengdu, China
3 China Unicom Software Research Institute, Beijing, China
4 Center for Strategic Assessment and Consulting, Academy of Military Science, Beijing, China
5 School of Business, Beijing Wuzi University, Beijing, China
6 Beijing Institute of Computer Technology and Application, Beijing, China

* Corresponding Author: Xiaoping Yang. Email: email

(This article belongs to the Special Issue: Advances in Wireless Sensor Networks: Security, Efficiency, and Intelligence)

Computers, Materials & Continua 2026, 88(1), 21 https://doi.org/10.32604/cmc.2026.078105

Received 24 December 2025; Accepted 12 February 2026; Issue published 08 May 2026

Abstract

The rapid proliferation of latency-sensitive applications, coupled with the limitations of service range, has driven the integration of aerial simultaneously transmitting and reflecting reconfigurable intelligent surfaces (ASTAR-RIS) and task offloading to enhance both communication and computational efficiency in wireless sensor networks (WSNs). However, in WSNs, conventional ASTAR-RIS-assisted task offloading faces critical limitations, including restricted endurance, underutilized network caching and computing resources, and inefficient resource allocation within the optimization framework. To overcome these challenges, this paper integrates wireless power transfer (WPT) technology and proposes a novel energy-efficient ASTAR-RIS and WPT-assisted task offloading and content caching framework for WSNs. Furthermore, we construct a minimization problem that jointly optimizes content caching, energy harvesting time, task offloading, and STAR-RIS resource allocation decisions to minimize energy consumption. Due to its inherently non-convex structure, the problem is addressed by separating it into four subproblems involving content caching, energy harvesting time, task offloading, and STAR-RIS resource allocation decisions. To address the above subproblems, a joint deep reinforcement learning (DRL)–successive convex approximation (SCA) based scheme is designed, which iteratively achieves the solution and attains near-optimal performance with relatively low computational complexity. Simulation results show that the proposed framework achieves more efficient resource utilization in WSNs and markedly lowers the total energy consumption of the system.

Keywords

Aerial simultaneously transmitting and reflecting reconfigurable intelligent surfaces; task offloading; wireless sensor networks; wireless power transfer; content caching

1 Introduction

With the rapid advancement of wireless sensor networks (WSNs), the explosive increase in the number of sensor devices has dramatically boosted the requirements for high transmission rates and ultra-low latency services [1]. This growth is partly due to emerging service paradigms, e.g., autonomous driving, augmented and virtual reality, and online game applications [2]. These emerging applications impose a great challenge for traditional networks based on centralized cloud-computing frameworks [3]. In order to effectively tackle this challenge, mobile edge computing (MEC) has arisen as a favorable solution by extending the computational capacity from the central cloud to the network edge cloud, which allows mobile devices (MDs) to offload workloads to proximate edge servers, cutting local energy expenditure and relieving computational burden [4]. Nevertheless, in traditional MEC systems, edge servers and base stations (BSs) are installed at fixed locations on the ground, which have two main disadvantages [5]. First, the quality of service cannot be guaranteed for MDs in remote areas or those blocked by obstacles. Second, terrestrial MEC links often experience severe signal attenuation, so uplink transmission performance remains unsatisfactory.

Unmanned aerial vehicle (UAV) assisted task offloading in WSNs, leveraging the controllable flexibility of UAVs, offers a promising solution to address the aforementioned challenges [6]. To take advantage of the superiority of the UAV, the lightweight edge server can be installed on the UAV, bringing computation physically closer to the users and thus boosting overall system performance [7]. Although UAV-assisted task offloading can effectively enhance the computing capacity of WSNs, existing UAV-assisted offloading frameworks are designed to adapt to uncontrollable random wireless channels, which significantly limits the task offloading efficiency [8]. To break this performance bottleneck caused by random wireless channels, the technology of simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) provides a promising solution [9]. STAR-RIS is capable of splitting the incoming signal into two parts, with one portion being reflected back in the direction of the incident signal and the other segment being transmitted in the opposite direction, providing omnidirectional 360∘ coverage [10]. Therefore, integrating the STAR-RIS into the UAV-assisted offloading systems is viewed as a win-win strategy, simultaneously strengthening link reliability and boosting edge-computing performance. Unfortunately, both UAVs and STAR-RIS face their respective deficiencies, e.g., UAVs are constrained by finite endurance, and the application of STAR-RIS still faces limited coverage owing to its fixed deployment.

To address these challenges, mounting STAR-RIS on the UAV to constitute the aerial STAR-RIS (ASTAR-RIS) to support terrestrial communication is a promising approach [11]. Compared with the UAV-carried server scheme, the ASTAR-RIS architecture constitutes a seamless upgrade to the traditional land server-based task offloading system without the need for routing modifications and system reconstruction [12]. As the STAR-RIS is lighter than an onboard edge server, it imposes only a negligible energy burden on the UAV [13]. In parallel, to further boost the sustainability and computing capacity of the UAV-assisted task offloading system, the wireless power transfer (WPT) technology has been proposed as a promising solution [14]. In this framework, the UAV and user devices harvest energy from radio frequency (RF) signals transmitted by a dedicated energy source and then utilize the harvested energy to support the local processing or task offloading of tasks [15].

1.1 Related Works

1.1.1 UAV-Assisted Task Offloading for WSNs

In recent years, although MEC has provided an effective means to enhance WSN computing capabilities, traditional deployment strategies that place MEC servers near ground BSs or access points (APs) often suffer from limited service coverage [5]. To address this shortcoming, UAVs have emerged as promising offloading assistants, owing to their inherent advantages such as exceptional mobility and flexibility [16–18]. Xu et al. [16] proposed a new system with the cooperation of multiple ground servers and one UAV server, which makes full use of the mobility of UAVs to compensate for the weakness of signal fading between ground servers and users and enhances the computing power of the network. Although UAVs can effectively improve the computing capacity of WSNs, the existing UAV-assisted task offloading frameworks are designed by adapting to uncontrollable random wireless channels, which seriously limits the task offloading efficiency.

1.1.2 STAR-RIS-Assisted Task Offloading for WSNs

STAR-RIS technology has recently emerged as a promising solution to address these challenges [9,19]. STAR-RIS splits the incoming signals into two distinct parts, one reflected back toward the incident direction and another transmitted toward the opposite direction, providing omnidirectional 360∘ coverage. Some studies mount the STAR-RIS on building façades to assist either the uplink offloading of tasks from ground users to an edge node or the downstream forwarding from the edge node to a terrestrial BS [20–22]. Eghbali et al. [20] proposed an integrated sensing and communication framework, and STAR-RIS, fixed on the exterior walls of buildings, enhances the channel performance between UAVs and users, effectively assisting UAVs in providing high-quality services to mobile users and sensing targets. Yang et al. [22] investigated energy-aware task offloading for rotatable STAR-RIS-enhanced MEC, jointly optimizing the STAR-RIS rotation angle and computing resource allocation to reduce system energy consumption. Despite the notable performance improvements offered by these designs, two key drawbacks persist. First, conventional, fixed-location STAR-RIS deployment severely restricts coverage and flexibility. Second, UAVs that simultaneously perform edge computing and relay functions must carry complex and heavy hardware.

1.1.3 ASTAR-RIS-Assisted Task Offloading for WSNs

To fully exploit the high-mobility potential of UAVs, Li et al. [23] have proposed mounting RIS on UAV platforms, enabling flexible aerial deployment to significantly enhance link quality. However, conventional RIS architectures are constrained by a reflection-only mode and inherently provide coverage over only a half-space region. To overcome this limitation, an aerial STAR-RIS (ASTAR-RIS) architecture that integrates STAR-RIS with UAVs has been proposed. Compared with fixed STAR-RIS deployments mounted on building facades, the ASTAR-RIS architecture offers superior deployment flexibility and practical value, thereby effectively improving task offloading efficiency in WSNs [11]. Aung et al. [11] explored the architecture of mounted STAR-RIS on the UAV, which effectively expanded the service coverage of the system by fully leveraging the mobility of UAVs and the channel control capability of STAR-RIS and minimizing the overall energy consumption of IoT devices and ASTAR-RIS. Collectively, these works confirm that the ASTAR-RIS architecture can dynamically reposition to optimize channel conditions and extend network service coverage, minimizing the system’s energy consumption.

1.2 Motivation and Contributions

Motivated by these key technologies, the ASTAR-RIS architecture has emerged as a promising solution for enhancing communication performance in WSNs. Although ASTAR-RIS-assisted offloading systems can substantially extend service coverage and improve wireless transmission quality, several critical challenges remain. First, the limited energy storage capacity of UAV batteries continues to impose a strict bound on their flight endurance. Second, the intrinsic caching and computing capabilities are not fully exploited, resulting in inefficient utilization of network resources and a deterioration in overall system performance. Addressing these issues provides the core motivation for this work. The main contributions of this paper are summarized as follows:

• In this paper, we propose an energy-efficient ASTAR-RIS and WPT-assisted task offloading and content caching framework aimed at minimizing the system energy consumption for WSNs. The proposed framework delivers adaptive and continuous computing and caching services. Moreover, we introduce the ASTAR-RIS architecture, where the STAR-RIS is mounted on the UAV to enable dynamic positioning and to optimize communication channels. Lastly, utilize WPT technology, enabling UAVs and user devices to harvest energy from dedicated RF signals, significantly improving the system’s self-sustainability and computational capacity.

• Due to the minimization problem of energy consumption, the inherently non-convex structure, the problem is addressed by separating it into four subproblems involving content caching, energy harvesting time, task offloading, and STAR-RIS resource allocation decisions. To address the above subproblems, a joint deep reinforcement learning (DRL)–successive convex approximation (SCA) based scheme is designed, which iteratively achieves the solution and attains near-optimal performance with relatively low computational complexity.

• Simulation results show that the proposed energy-efficient ASTAR-RIS and WPT-assisted task offloading and content caching framework achieves more efficient resource utilization in WSNs and markedly lowers the total energy consumption of the system, outperforming benchmark solutions, particularly in scenarios with limited network resources or computationally intensive tasks.

The remainder of this paper is structured as follows. Section 2 begins by presenting the system model and formulating the core optimization problem. Building upon this foundation, Sections 3 and 4 detail the proposed optimization algorithm and its iterative solution procedure. Subsequently, the convergence analysis of the devised DRL-SCA algorithm is provided in Section 5. Section 6 then presents the simulation setup and discusses the corresponding numerical results. Finally, the paper concludes with a summary in Section 7.

2 System Model and Problem Formulation

This section presents the system models for the proposed ASTAR-RIS and WPT-assisted WSNs, including network, communication, energy harvesting, task offloading, and caching models, as well as an analysis of system energy consumption. Building on these models, we then formulate the corresponding optimization problem for minimizing the total system energy consumption.

2.1 Network Description

As illustrated in Fig. 1, we consider an ASTAR-RIS and WPT-assisted WSN comprising a single BS, a UAV integrated with a STAR-RIS, and a set of single-antenna user devices (UDs) [24]. The direct links between the UDs and the BS are assumed to be blocked. Consequently, a STAR-RIS is introduced to facilitate connectivity by jointly reflecting and transmitting signals. It is implemented as a uniform planar array (UPA) containing Mc × Mr passive units and operates in the energy-splitting (ES) mode [25] for flexible beam manipulation. Service provision for the UD set 𝒟={1,2,…,D} is supported by both the BS b and the UAV u. In particular, a computing server and an RF energy transmitter are integrated at the BS so that it can broadcast wireless energy to the UAV and UDs and execute computational tasks, while the UAV and UDs have rechargeable batteries, energy harvesting (EH) circuit components, and computing servers that can store the harvested energy to power their operation. Moreover, caching servers are integrated at the BS and the UAV so that they can cache computational results in advance and reduce computing resource consumption. Assuming that the transceivers of both UDs and the UAV operate in half-duplex mode, task offloading and local processing can only start after the EH phase is completed, as per the harvest-then-transmit protocol [26]. To ensure energy sustainability, the total energy consumed by the UAV and each UD for task processing and offloading is constrained to be no greater than the amount of energy harvested from the RF signals [27].

images

Figure 1: System model of the ASTAR-RIS and WPT-assisted WSNs.

In the task scenario description, when the task application programs and their related parameter data are cached by the UAV u or BS b, the system directly processes the caching task and returns the task results. By contrast, in scenarios where the relevant data are not stored in the system cache, we adopt a partial offloading strategy for computation tasks that are sensitive to latency. This task offloading framework enables parallel processing of tasks across three entities: the UDs, UAV u, and BS b.

In this paper, we consider a quasi-static service period during which the UAV hovers at a fixed location. This modeling choice is adopted to keep the problem tractable and to highlight the proposed WPT-assisted ASTAR-RIS framework. Note that the joint optimization of UAV hovering location with STAR-RIS passive beamforming, task offloading, and content caching has been thoroughly investigated in our prior work [28]. Therefore, we treat the hovering location as a given parameter and focus on the novel joint optimization of WPT-enabled energy harvesting time, STAR-RIS resource allocation, content caching, and task offloading decisions. It is worth noting that the proposed framework exhibits inherent adaptability to dynamic channel conditions through the real-time reconfiguration of STAR-RIS passive beamforming. The iterative DRL-SCA algorithm updates resource allocation decisions based on current channel state information, enabling the system to maintain energy efficiency under time-varying wireless environments.

2.2 Communication Model

We first derive the uplink data rates corresponding to task offloading from UDs to the UAV and BS, as well as the channel gains associated with the BS’s wireless energy broadcast to the UAV and UDs. It is presumed that UDs remain relatively stationary throughout the task offloading process. To characterize the positions of UD d, UAV u, STAR-RIS, and BS b, a three-dimensional (3D) cartesian coordinate system is constructed. Specifically, the locations of UD d and BS b are represented by the vectors rd=(xd,yd,0) and rd=(xd,yd,0), respectively. Additionally, we suppose that in each time slot, the UAV u hovers at a fixed position to deliver services to the UDs. Let the 3D coordinate of the UAV u and the (mc,mr)-th STAR-RIS element are denoted as ru=(xu,yu,zu) and r(mc,mr)=(x(mc,mr),y(mc,mr),z(mc,mr)), respectively. All considered links, between UD d and each STAR-RIS element, UD d and UAV u, and each STAR-RIS element and BS b, are all modeled as line-of-sight (LoS) channels and follow a free-space path-loss model [29]. Consequently, the complex channel gain between any two distinct nodes i and i′ (where i∈{d,(mc,mr),b}, i′∈{u,(mc,mr),b,d}) is expressed as hi,i′=q0Si,i′−2, where q0 is the reference received power at a reference distance of one meter under unit transmit power, and Si,i′ denotes the Euclidean distance between nodes i and i′, given by Si,i′=(xi−xi′)2+(yi−yi′)2+(hi−hi′)2.

In the system, an orthogonal frequency-division multiple access (OFDMA) scheme is considered to eliminate the intra-cell interference from nodes. Consequently, for the wireless link from node i to node i′, the signal-to-noise ratio (SNR) is given by γi,i′=Pi|hi,i′|2σi,i′2. The Pi denotes the node i’s transmit power, and σi,i′2 is the noise variance at the receiver, which is assumed to be constant variance for the considered links. Accordingly, this SNR then yields the achievable transmission rate Ri,i′=Bi,i′log2⁡(1+γi,i′), with Bi,i′ denoting the allocated bandwidth.

2.2.1 Channel in UAV Task Offloading and in UAV Energy Harvesting

Equipped with an MEC server, UAV u offers additional computing capacity to nearby UDs. The received signal is given by Vu=∑d∈𝒟hd,uPdsd+nu, where sd satisfies E{|sd|2}=1 and nu∼𝒩(0,σu2) is the AWGN. Consequently, the signal-to-noise ratio γd,u and transmission rate Rd,u of the UD-to-UAV link can be derived accordingly. Correspondingly, the signal-to-noise ratio γb,u and transmission rate Rb,u of the BS-to-UAV link can be derived accordingly.

2.2.2 Channel in BS Task Offloading and in UD Energy Harvesting

Due to its limited onboard energy and the task-accomplishing deadline constraints, UAV u is only capable of executing a portion of the residual tasks it receives, while the remaining part must be relayed to the BS b via the STAR-RIS for further processing. For the STAR-RIS-assisted link, we denote g(mc,mr),b as the channel gain from its (mc,mr)-th element to the BS. Hence, the composite channel involves two segments: the UD–STAR-RIS link hd,s and the STAR-RIS–BS link gs,bH. The STAR-RIS response to the signal from UD d is modeled by separate coefficient matrices Θr (reflection) and Θt (transmission), each parameterized as Θa=diag(β(1,1)aejθ(1,1)a,…,β(Mc,Mr)aejθ(Mc,Mr)a). The β(mc,mr)a and θ(mc,mr)a represent the amplitude and phase shift of element (mc,mr) for mode a∈{r,t}. The ES mode imposes the feasibility condition ∑a∈{r,t}β(mc,mr)a≤1.

Accordingly, the composite signal at BS b is obtained as Vb=∑d∈𝒟gs,bHΘahd,sPdsd+nb, with nb=n∼𝒩(0,σb2). Here, a=t if UD d is in the transmission region, and a=r otherwise. Consequently, the SNR γd,b and data rate Rd,b for the UD d-to-BS b link are functions of the effective channel gain gs,bHΘahd,s. Correspondingly, the downlink SNR γb,d and data rate Rb,d from BS b to UD d are determined by the effective channel gain gs,dHΘahb,s through the STAR-RIS.

Since the achievable rates Ri,i′ are strongly coupled with the instantaneous SNRs, poor coverage or low-SNR conditions can significantly tighten the energy and latency budgets. Following the insights in Ref. [30], we incorporate adaptive resource allocation via the joint optimization of Th, α, and Θ to mitigate performance degradation under unfavorable channel conditions.

2.3 Energy Harvesting Model

Let Tuh denote the energy harvesting duration of the UAV during the EH phase, Tdh denote the energy harvesting duration of the UDs during the EH phase, and Pb denote the transmit power of the BS, where 0≤Pb≤Pb,max and Pb,max is the maximum allowed transmission power at the BS. Then, the harvested energy of the UAV u during the EH phase can be computed as Eb,uh=TuhηPbhb,u. hb,u is the channel power gain between the BS b and the UAV u and η (0≤η≤1) denotes the energy conversion efficiency. Similarly, the harvested energy of the UD d during the EH phase can be computed as Eb,dh.

2.4 Task Offloading Model

In this paper, we assume that the computing task is divisible, which means that the task can be divided into more parts. Thus, some computing parts can be first handled locally at the UD, and then some parts can be processed at the UAV u, while the remaining parts can be processed at the BS b. Considering the offloading of task k from UD d, we introduce a continuous decision variable αd,j′k∈[0,1], where j′∈{d,u,b} denotes the candidate execution node (local device, UAV, or BS). This variable represents the fraction of the task k offloaded to node j′. Therefore, the overall offloading strategy is defined by the vector α={αd,dk,αd,uk,αd,bk}, subject to the constraint ∑j′∈{d,u,b}αd,j′k=1.

When task k is uncached, its service latency consists of two components: the uplink transmission time and the remote computation time. In contrast, the downlink delay is ignored because the UAV and BS typically transmit with relatively high power, and the computation results are small in size, making the download time negligible [31]. Specifically, the corresponding energy consumption for offloaded task k from UD d to node j over the uplink is then given by Ed,jtr,k=Pdαd,jkLkRd,j. Furthermore, the energy consumption for computing the offloaded fraction αd,j′k of task k at node j′ can be expressed as Ed,j′com,k=κj′αd,j′kωkfj′fj′3, with ωk, fj′, and κj′ representing the task’s CPU cycle demand, the node’s CPU frequency, and its architecture-dependent capacitance coefficient, respectively.

2.5 Caching Model

The content caching strategy is defined by a binary decision variable Xjk∈{0,1}, where Xjk=1 indicates task k is cached at node j, and 0 otherwise. The overall caching strategy is characterized by the vector X={Xj1,Xj2,…,Xjk}. Based on this caching decision, we further characterize the execution delay and energy consumption at the UAV u and the BS b under both cache-hit and cache-miss conditions.

When task k is cached at node j(Xjk=1), the corresponding execution delay is denoted by Td,jcache,k and is given by Td,jcache,k=ωk′fj. The ωk′ denotes the CPU cycle count required for the caching operation of task k. Since this computation is performed at the UAV u or the BS b, the major energy consumption is incurred at these nodes, and the energy consumption at the UDs is neglected. Moreover, because the energy required for caching is relatively small compared with that for communication and computation, the UAV is assumed to supply this energy from its onboard battery without relying on the harvested wireless power. Therefore, the energy consumption for caching task k, denoted Ed,jcache,k, is Ed,jcache,k=κjωk′fj2. The node j’s effective capacitance coefficient κj, which depends on the chip architecture of the processor, determines the caching energy consumption.

2.6 Problem Formulation

From the established models of communication, energy harvesting, computation, and caching, we derive the expressions for the total delay Tdk and energy consumption Edk incurred when UD d requests and obtains the result of task k:

Tdk=∑j∈{u,b}XjkTd,jcache,k+(1−∑j∈{u,b}Xjk)max{max{Tuh,Tdh}+Td,utr,k+Td,ucom,k,Tdh+Td,btr,k+Td,bcom,k,Tdh+Td,dcom,k},(1)

Edk=∑j∈{u,b}XjkEd,jcache,k+(1−∑j∈{u,b}Xjk)(Eb,uh+Eb,dh+Ed,bcom,k).(2)

The total energy consumption consists of three primary components: (i) When the network system caches the task (Xjk = 1), the energy consumption associated with caching the task in the network system is considered; (ii) The energy transmitted to the UAV and UDs through the RF signals sent by the BS; (iii) When the network system does not cache the task (Xjk=0), the energy consumption is attributed to computation energy consumption of the BS. This paper aims to minimize the total system energy consumption in ASTAR-RIS and WPT-assisted WSNs. This entails the joint optimization of four key variables: the content caching decision X, energy harvesting time Th, task offloading decision α, and the STAR-RIS resource allocation decision Θ. Accordingly, based on (2), the corresponding total energy minimization problem can be formulated as

𝒫1: minX,Th,α,Θ∑d∈𝒟∑k∈𝒦Edk(3)

s.t. ∑d∈𝒟Ed,ucom,k≤Eb,uh,(3a)

(Ed,utr,k+Ed,btr,k+Ed,dcom,k)≤Eb,dh,(3b)

αd,j′k∈[0,1],∀d,j′∈{d,u,b},(3c)

∑j′∈{d,u,b}αd,j′k=1,(3d)

θ(mc,mr)a∈[0,2π),∀a,∀mc,∀mr,(3e)

∑a∈{r,t}β(mc,mr)a≤1,β(mc,mr)a∈[0,1],∀a,∀mc,∀mr,(3f)

∑k∈𝒦XjkSk≤Oj,∀k,∀j,(3g)

∑k∈𝒦αd,j′kωk≤fj′,∀k,∀d,∀j′,(3h)

∑j∈{u,b}Xjk≤1,∀k,∀j,(3i)

Xjk∈{0,1},∀k,∀j,(3j)

Tdk≤Tdk,max,∀d,∀k.(3k)

Constraints (3a) and (3b) ensure that, at each node, the total energy spent on local computation and task offloading does not exceed the energy harvested during the EH phase. Constraint (3c) restricts the offloading ratio of each node to lie within ([0, 1]). Constraint (3d) enforces that, for each task k, the offloading ratios across all possible destinations sum to one. Constraints (3e) and (3f) specify the magnitude and phase requirements, respectively, for the (mc,mr)-th STAR-RIS element applied to the incident signal. The inequality constraint in (3f) follows from the law of energy conservation, which states that the total output power of passive STAR-RIS elements cannot exceed the input power [25]. This formulation is consistent with the general hardware model proposed in [32], where practical insertion losses and design flexibility are accounted for. Constraint (3g) states that each node’s total cached content size cannot be greater than its maximum caching capacity. The maximum computing capability of a node cannot be exceeded by the total computing resources used at the node to handle tasks, according to constraint (3h). Constraint (3i) ensures that the same content must not be stored redundantly at both the UAV and BS nodes to achieve cooperative caching. Constraint (3j) restricts the range of all boolean variables to either 0 or 1. Constraint (3k) means that all tasks should be accomplished in the maximum tolerable time.

It is important to note that the energy minimization objective is subject to quality-of-service constraints that ensure system reliability. Specifically, constraint (3k) guarantees task completion within the maximum tolerable delay, while constraints (3a) and (3b) ensure energy sustainability. The STAR-RIS beamforming optimization further enhances transmission reliability by improving effective channel gains. Therefore, the proposed framework achieves energy efficiency without compromising task completion time or data transmission reliability.

In UAV-mounted STAR-RIS systems, mobility may introduce propulsion-energy and endurance trade-offs [33]. In this work, we adopt a quasi-static hovering model during the service period and focus on the WPT-enabled energy-sustainability constraints and the joint optimization of (X,Th,α,Θ), where WPT further improves the UAV’s energy sustainability via RF energy harvesting.

Following the schematic in Fig. 2, problem (3) is addressed by an alternating optimization strategy that partitions it into three subproblems:

images

Figure 2: The joint optimization scheme for minimizing the total energy consumption.

• Caching decision subproblem: With Th, α, and Θ fixed at their initial values Th0, α0, and Θ0, problem (3) reduces to optimizing the caching decision vector X for total energy minimization, which is addressed using a DRL algorithm to obtain the optimal solution X∗.

• Energy harvesting time subproblem: With X, α, and Θ fixed at X∗, α0, and Θ0, problem (3) reduces to minimizing the total system energy consumption by optimizing the energy harvesting time vector Th, which is solved analytically via the karush-kuhn-tucker (KKT) conditions to obtain the optimal solution Th∗.

• Offloading decision subproblem: With X, Th, and Θ fixed at X∗, Th∗, and Θ0, problem (3) reduces to minimizing the total system energy consumption over the offloading decision vector α, which is solved via the KKT conditions to yield the optimal solution α∗.

• STAR-RIS resource allocation subproblem: With X, Th, and α fixed at their optimal values X∗, Th∗, and α∗, the problem reduces to minimizing the total system energy consumption over the STAR-RIS resource allocation decision Θ, which is solved via the SCA method to obtain the optimal solution Θ∗.

3 Content Caching Decision Optimization

Due to its dependence only on X and not on other variables in 𝒫1, the caching decision optimization can be addressed beforehand once the energy harvesting time, offloading decisions, and passive beamforming are fixed. This leads to the following subproblem formulation:

𝒫2: minX∑d∈𝒟∑k∈𝒦Edks.t. (3g)−(3k)(4)

Since X is a continuous probability vector, 𝒫2 belongs to a continuous nonlinear programming problem. Caching decision optimization is a dynamically adjusted optimization problem rather than a static convex optimization problem. Owing to the highly non-convex and coupled structure of the problem, conventional optimization methods are prone to getting trapped in local optima rather than locating the global optimum. To overcome this limitation, we employ the twin delayed deep deterministic policy gradient (TD3) algorithm, a representative state-of-the-art DRL approach, to tackle the caching decision subproblem. TD3 is capable of handling continuous probability vectors and is designed to learn optimal caching policies in complex environments. In this context, the entire caching system is modeled as an agent that interacts with the environment. This interaction enables the agent to approximate an optimal caching policy over time. To achieve better optimization performance, we design the state space, action space, and reward function in the proposed intelligent caching markov decision process (MDP) model as follows:

• State: At time slot (t), the agent state is defined as st={𝒫t,𝒬t,𝒞t,ℰt,𝒪t}. 𝒫t denotes the content popularity, modeled by a time-varying zipf distribution; 𝒬t=[𝒬1,t,𝒬2,t,…,𝒬K,t] captures the historical request frequency; 𝒞t=[𝒞u,t,𝒞c,t] describes the caching status at the UAV and the BS; ℰt=[ℰd1,t,ℰd2,t,…,ℰD,t,ℰu,t] characterizes the energy harvesting status of all UDs and the UAV; The term 𝒪t represents the relevant network topology information at slot t.

• Action: In the caching decision stage, the continuous action at encodes the caching probabilities at both the UAV and the BS, at=[π1,t+1u,…,πK,t+1u,π1,t+1b,…,πK,t+1b]. The πk,t+1u and πk,t+1b represent the probabilities of placing content k in the UAV cache and BS cache, respectively. Then select top-KUAV and top-KBS contents respectively based on probabilities.

• Reward: The reward function is a key component for steering the agent’s exploration in the caching update task and for ensuring stable convergence. Accordingly, at time slot t, we define rt=λ1∑j∈{u,c}𝒜tj−λ2Wt. The λ1 and λ2 are the weight parameters used to balance different optimization objectives; The variable 𝒜tj captures the cache-hit count at node j (UAV or BS) during slot t, serving as a direct metric for caching efficiency; and Wt denotes the number of cache switching operations, which incurs energy consumption of storage overhead and computational cost, thus requiring minimization.

The TD3-based caching policy dynamically adapts to varying task types and urgency through the comprehensive state representation. The content popularity 𝒫t and historical request frequency 𝒬t enable the agent to learn temporal patterns and prioritize time-sensitive or frequently requested content. The energy harvesting status ℰt further allows the policy to balance caching benefits against energy constraints. During each decision epoch, the agent evaluates the current state and outputs caching probabilities that reflect the expected value of caching each content type, thereby achieving dynamic adaptation to heterogeneous task characteristics.

4 Energy Harvesting Time, Offloading Decision, and STAR-RIS Resource Allocation

After the caching policy is obtained by the TD3-based module, the energy harvesting time Th, task offloading vector α, and STAR-RIS resource allocation decision Θ are further refined through an iterative procedure that combines the KKT conditions with SCA. Accordingly, problem (3) can be reformulated as

𝒫3: minTh,α,Θ∑d∈𝒟∑k∈𝒦(∑j∈{u,b}Xjk∗Ed,jcache,k+(1−∑j∈{u,b}Xjk∗)(Eb,uh+Eb,dh+Ed,bcom,k))s.t. (3a)−(3f),(3h),(3k),∑k∈𝒦Xjk∗Sk≤Oj,∀k,∀j,(5a)

Xjk∗∈{0,1},∀k,∀j,(5b)

∑j∈{u,b}Xjk∗≤1,∀k,∀j.(5c)

4.1 Energy Harvesting Time

Given that the content caching strategy X=X∗, offloading decision α=α0, and STAR-RIS resource allocation Θ=Θ0 are fixed, the energy harvesting time Th={Tuh,Tdh} optimization problem is formulated to minimize the total energy consumption. The problem (5) is rewritten as:

minTuh,Tdh∑d∈𝒟∑k∈𝒦(TuhηPbhb,u0+TdhηPbhb,d0)(6a)

s.t.∑d∈𝒟Ed,ucom,k≤TuhηPbhb,u0,∀d,∀k,(6b)

Ed,utr,k+Ed,btr,k+Ed,dcom,k≤TdhηPbhb,d0,∀d,∀k,(6c)

Tdk≤Tdk,max,∀d,∀k,(6d)

where hb,d0 is determined by the given STAR-RIS parameters Θ0. The corresponding lagrangian of problem (7), denoted by L(Tuh,Tdh,λ,ν,ξ), can be expressed as:

ℒ=∑d∈𝒟∑k∈𝒦(TuhηPbhb,u0+TdhηPbhb,d0+λd,k(∑d∈𝒟Ed,ucom,k−TuhηPbhb,u)+νd,k(Ed,utr,k+Ed,btr,k+Ed,dcom,k−TdhηPbhb,d0)+ξd,k(Tdk−Tdk,max)).(7)

The optimal energy harvesting time vector Th∗=(Tuh∗,Tdh∗) can be derived from the KKT conditions as follows:

Th∗=argmin{ T˜uh,T˜dh,λ˜,ν˜,ξ˜ }EEHtotal(X∗,Tuh,Tdh,α0,Θ0).(8)

The feasible solution set ℳEH can be derived by solving the KKT conditions (9). Any point {T~uh,T~dh,λ~,ν~,ξ~} satisfying these conditions belongs to ℳEH, and the optimal energy harvesting times correspond to the solution presented in (8).

∂L∂Tuh|Tuh=T~uh,λ=λ~,ν=ν~,ξ=ξ~=0,(9a)

∂L∂Tdh|Tdh=T~dh,λ=λ~,ν=ν~,ξ=ξ~=0,(9b)

∑d∈𝒟Ed,ucom,k≤TuhηPbhb,u0,∀d,∀k,(9c)

Ed,utr,k+Ed,btr,k+Ed,dcom,k≤TdhηPbhb,d0,∀d,∀k,(9d)

Tdk≤Tdk,max,∀d,∀k,(9e)

λ~d,k(∑d∈𝒟Ed,ucom,k−TuhηPbhb,u)=0,∀d,∀k,(9f)

ν~d,k(Ed,utr,k+Ed,btr,k+Ed,dcom,k−TdhηPbhb,d0)=0,∀d,∀k,(9g)

ξ~d,k(Tdk−Tdk,max)=0,∀d,∀k,(9h)

λ~d,k≥0,ν~d,k≥0,∀d,ξ~d,k≥0,∀k.(9i)

4.2 Offloading Decision

Given that the caching decision X=X∗, energy harvesting time Th=Th∗, and STAR-RIS resource allocation Θ=Θ0 are fixed, the offloading decision α optimization problem aims to minimize the total energy consumption. The problem (5) is reformulated as:

minα∑d∈𝒟∑k∈𝒦((1−∑j∈{u,b}Xjk∗)κbαd,bkωkfb2)s.t.(3a)−(3d),(3h),(3k).(10a)

The Lagrange function is constructed as:

ℒ=∑d∈𝒟∑k∈𝒦((1−∑j∈{u,b}Xjk∗)κbαd,bkωkfb2+ηd,k(∑d∈𝒟Ed,ucom,k−Eb,uh)+∑j′∈{d,u,b}εd,k1(αd,j′k−1)+μd,k(Ed,utr,k+Ed,btr,k+Ed,dcom,k−Eb,dh)+∑j′∈{d,u,b}εd,k2(−αd,j′k)+χd,k(∑j′∈{d,u,b}αd,j′k−1)+ζd,k(∑k∈𝒦αd,j′kωk−fj′)+ξd,k(Tdk−Tdk,max))(11)

The optimal offloading vector α∗, obtained via the KKT conditions, can be expressed as:

α∗=arg⁡min{α~,η~,μ~,ε1~,ε2~,χ~,ζ~,ξ~} EOFFtotal(X∗,Th∗,α,Θ0).(12)

Any point {α~,η~,μ~,ε1~,ε2~,χ~,ζ~,ξ~} that satisfies the KKT conditions (13) lies within the feasible set ℳOFF. Consequently, solving (13) allows ℳOFF to be derived, and the optimal offloading decision given in (12) to be obtained.

∂ℒ∂αd,j′k|αd,j′k=α~d,j′k,η=η~,μ=μ~,ε1=ε1~,ε2=ε2~,χ=χ~,ζ=ζ~,ξ=ξ~=0,(13a)

(3a)−(3d),(3h),(3k),η~d,k(∑d∈𝒟Ed,ucom,k−Eb,uh)=0,∀d,∀k,(13b)

μ~d,k(Ed,utr,k+Ed,btr,k+Ed,dcom,k−Eb,dh)=0,∀d,∀k,(13c)

ε~d,k1(αd,j′k−1)=0,∀d,∀j′,∀k,(13d)

ε~d,k2αd,j′k=0,∀d,∀j′,∀k,(13e)

χ~d,k(∑j′∈{d,u,b}αd,j′k−1)=0,∀d,∀j′,∀k,(13f)

ζ~d,k(∑k∈𝒦αd,j′kωk−fj′)=0,∀d,∀j′,∀k,(13g)

ξ~d,k(Tdk−Tdk,max)=0,∀d,∀j′,∀k,(13h)

η~d,k≥0,μ~d,k≥0,ε~d,k1≥0,ε~d,k2≥0,ζ~d,k≥0,ξ~d,k≥0,∀d,∀k,(13i)

where χ~d,k is a Lagrange multiplier associated with an equality constraint and is treated as an unrestricted multiplier consistent.

4.3 STAR-RIS Passive Beamforming

Given X=X∗, Th=Th∗, and α=α∗, problem (5) is reformulated as:

minΘ∑d∈𝒟∑k∈𝒦(TdhηPb|gs,dHΘahb,s|)s.t.(3b),(3e),(3f),(3k).(14a)

In order to deal with the non-convex problem, the SCA method is adopted. First, define the auxiliary variable ζd,k≥0, satisfying: ζd,k≥|gs,dHΘahb,s|. Second, we convert the quadratic term |gs,dHΘahb,s|2 as |νaHy|2, where y=diag(gs,dH)hb,s. Then we apply first-order taylor expansion to the quadratic term ζd,k2 and update the linearization point in each iteration. Specifically, the result ζd,k[t−1] from the previous iteration is employed as the current reference ζd,k[t]. (ζd,k[t])2+2ζd,k[t](ζd,k−ζd,k[t])≥νaHyyHνa. νa=vec(Θa), and ζd,k[t] denote the value at the tth iteration. Consequently, constraints (3f) and (3g) are reformulated as the following convex constraint:

∑a∈{r,t}νa2[(mc,mr)]≤1,(15)

where νa[(mc,mr)] is the (mc,mr)-th element of νaH. Since it is a quadratic function and the feasible set is convex, this constraint (15) is strictly convex, thereby ensuring the validity of the convex reformulation. Therefore, the convexified problem is written as:

minνa,ζ∑d∈𝒟∑k∈𝒦TdhηPbζd,k(16a)

s.t.νaHyyHνa≤(ζd,k[t])2+2ζd,k[t](ζd,k−ζd,k[t]), ∀d,k(15),(3b),(3k).(16b)

The lagrangian function L is defined as follows:

L(νa,ζ,γ,λ,μ,ξ)=∑d∈𝒟∑k∈𝒦(TdhηPbζd,k+λd,k(νaHyyHνa−(ζd,k[t])2−2ζd,k[t](ζd,k−ζd,k[t]))+γd,k(Ed,utr,k+Ed,btr,k+Ed,dcom,k−Eb,dh)+μd,k(∑a∈{r,t}νa2[(mc,mr)]−1)+ξd,k(Tdk−Tdk,max)),(17)

where γ, λ, μ, and ξ are the lagrangian multipliers associated with the four constraints of problem (16). The lagrangian dual function is written as D(γ,λ,μ,ξ)=minνa,ζL(νa,ζ,γ,λ,μ,ξ). The KKT conditions must be satisfied at optimality for both problem (16) and its dual (D(γ,λ,μ,ξ)), by virtue of the convexity of the primal problem. By setting ∂L∂νa=0 and ∂L∂ζ=0 and solving the resulting equations, the optimal solution νa∗ and ζ∗ are derived as νa∗=arg⁡minνaL(νa,ζ,γ,λ,μ,ξ) and ζ∗=arg⁡minζL(νa,ζ,γ,λ,μ,ξ). The lagrangian multipliers γ, λ, μ, ξ are updated according to:

γ[k+1]=[γ[k]+Δγ[k](Ed,utr,k+Ed,btr,k+Ed,dcom,k−Eb,dh)]+;(18)

λ[k+1]=[λ[k]+Δλ[k](νaHyyHνa−(ζd,k[t])2−2ζd,k[t](ζd,k−ζd,k[t]))]+;(19)

μ[k+1]=[μ[k]+Δμ[k](∑a∈{r,t}νa2[(mc,mr)]−1)]+;(20)

ξ[k+1]=[ξ[k]+Δξ[k](Tdk−Tdk,max)]+;(21)

The step size vector Δ2[k]=(Δγ[k],Δλ[k],Δμ[k],Δξ[k])T updates the lagrangian multipliers γ, λ, μ, and ξ in the k-th iteration, and is refreshed in each subsequent iteration. The non-negativity of multipliers is preserved by the projection operator [⋅]+=max(⋅,0).

Given the optimal variables νa∗ and ζ∗, the corresponding optimal STAR-RIS resource allocation decision is constructed as Θa∗=diag(νa∗)H. νa∗ is iteratively updated to ensure convergence to a stable point that satisfies all constraints.

5 Convergence and Complexity Analysis of DRL-SCA Algorithm

Algorithm 1 presents the workflow of the DRL–SCA framework, in which the caching decision is first obtained by the DRL module and then used as a fixed parameter to iteratively optimize the energy harvesting time, offloading decision, and STAR-RIS resource allocation decision. In what follows, we investigate the algorithm’s convergence and demonstrate that it can reach a locally suboptimal point within a finite number of processes. At last, computational complexity analysis.

images

5.1 Convergence Analysis

Lemma 1: Algorithm 1 generates a non-increasing objective sequence and converges to a stationary solution under the prescribed stopping criterion.

Proof: Let E(ν) denote the objective value at the ν-th outer iteration. The proposed algorithm follows a block coordinate descent (BCD) structure over X,Th,α,Θ.

Descent property of block updates: For fixed X, the EH-time and offloading subproblems are convex and are solved optimally, hence they do not increase the objective. For the STAR-RIS block, the SCA procedure solves a sequence of convexified subproblems that provide upper bounds on the original non-convex objective at the current iterate, which ensures a non-increasing objective across SCA iterations and thus across outer iterations. For the caching block, the TD3 agent is trained offline to yield a stable policy. In the online procedure, the caching decision Xν+1 is obtained by policy inference and then projected onto the feasible binary set under cache-size constraints. To ensure monotonicity in the outer loop, we apply an acceptance check: if the updated caching decision does not decrease the objective, we keep Xν+1=Xν. Therefore, the overall objective sequence satisfies E(ν+1)≤E(ν).

Lower boundedness: Since E(ν) represents total system energy consumption, it is non-negative, i.e., E(ν)≥0.

Convergence: Because E(ν) is non-increasing and lower bounded, it converges. Upon termination (when the relative improvement falls below a threshold), the algorithm reaches a stationary point of the BCD procedure. ◻

The above result demonstrates that the original problem 𝒫1 is decomposed into four subproblems and solved iteratively via block coordinate descent (BCD). At every iteration, the obtained solution serves as the starting point for the next one. Each block-wise update, encompassing caching decision optimization via DRL, energy harvesting time optimization, offloading decision optimization, and STAR-RIS resource allocation decision optimization, yields a non-increasing overall objective. Consequently, convergence to a stationary point (local suboptimum) is guaranteed, provided that the algorithm is terminated once the objective improvement becomes negligible or a maximum iteration limit is reached.

5.2 Computational Complexity Analysis

The complexity is analyzed by separating the offline TD3 training stage and the online alternating optimization stage.

Offline TD3 training: The TD3 caching agent is trained offline. The total training complexity is 𝒪(Nepi⋅Nstep⋅(Cactor+Ccritic)), where Nepi and Nstep denote the number of training episodes and steps per episode, and Cactor, Ccritic denote the computational cost of one forward/backward update of the actor/critic networks.

Online alternating optimization: At runtime, TD3 only performs policy inference (a forward pass) with complexity 𝒪(Cactor), followed by a feasible projection to obtain binary caching decisions. The EH-time and offloading subproblems are convex and can be solved by interior-point methods with complexity on the order of 𝒪(n3log⁡(1/ε)), where n denotes the number of decision variables and ε is the solver accuracy. The STAR-RIS block uses SCA with NSCA inner iterations; each iteration solves a convexified problem whose dimension scales with the number of STAR-RIS elements M=McMr, yielding complexity 𝒪(NSCA⋅nΘ3log⁡(1/ε)) with nΘ=𝒪(M).

The nEH=𝒪(D), nα=𝒪(DK), and nΘ=𝒪(McMr) represent the number of decision variables in the EH-time, offloading, and STAR-RIS subproblems, respectively, where D is the number of UDs and K is the number of tasks. Let Niter be the number of outer iterations. So, the overall online complexity is 𝒪(Niter(Cactor+nEH3log⁡1ε+nα3log⁡1ε+NSCA⋅nΘ3log⁡1ε)).

6 Performance Evaluation and Discussion

To assess the effectiveness of the proposed energy-efficient ASTAR-RIS and WPT-assisted task offloading and content caching framework for WSNs, we consider the following simulation setup. The UAV u, the ground BS b, and the STAR-RIS are placed at (0,0,20) m, (0,40,0) m, and (0,0,20) m, respectively. Four ground UDs are located at (−40,−40,0) m, (−40,40,0) m, (40,−40,0) m, and (40,40,0) m, forming a representative distributed sensor network layout [34].

The UAV is positioned at (0,0,20) m, which is the geometric center of the considered symmetric UD topology. This placement yields balanced large-scale path loss to the UDs and avoids favoring any particular user, thereby serving as a controlled baseline to evaluate the energy-efficiency gains brought by the proposed WPT-assisted ASTAR-RIS joint optimization. We emphasize that the purpose of fixing the hovering location is to isolate the benefits of WPT integration and joint resource optimization; varying the hovering point would affect the absolute energy values but not the qualitative insights or the role of each optimized component.

Furthermore, to highlight the benefits of the proposed framework, we benchmark it against several representative baseline schemes, namely no STAR-RIS, no energy harvesting (no EH), no caching, full offloading [35,36]. These baselines enable us to quantitatively assess the contribution of joint optimization of caching, energy harvesting, offloading, and STAR-RIS configuration to energy savings and system performance enhancement. The configurations of some simulation schemes are specified as follows.

• Proposed: In the ASTAR-RIS and WPT-assisted WSNs, the total energy consumption is reduced by performing a joint optimization over content caching decisions, energy harvesting time, task offloading decisions, and STAR-RIS resource allocation decisions.

• No EH: The no EH policy eliminates the optimization of energy harvesting time and instead relies on fixed power supplies at both local UDs and the UAV for task processing.

Fig. 3 illustrates the convergence of total system energy consumption vs. iteration count. The proposed scheme demonstrates rapid convergence, reaching the optimal value within merely four iterations, which confirms the consistent availability of the joint optimal solution (caching, energy harvesting, offloading, and STAR-RIS resource allocation). Moreover, it exhibits a slightly faster convergence speed and ultimately a lower total energy consumption compared to all benchmark schemes, verifying the energy efficiency superiority of our ASTAR-RIS and WPT-assisted framework for WSNs.

images

Figure 3: Total energy consumption vs the number of iterations.

Fig. 4 illustrates that the total energy consumption across all considered schemes increases significantly with the BS’s enhanced computational capacity. While a more powerful BS accelerates task processing, the unchanged offloading ratio leads to a substantial rise in energy consumption for remote centralized computing, thereby driving the overall increase. As a result, when the offloading proportion is fixed, the overall energy consumption grows with the increase in BS computation capacity. At lower BS capacities, our proposed joint optimization strategy, which integrates caching, energy harvesting, partial offloading, and STAR-RIS beamforming, shows a minor performance gap relative to benchmarks while consistently maintaining optimal system performance. As BS capacity grows, remote computing energy becomes the dominant contributor to total consumption. Under these circumstances, our proposed solution’s synergy of caching optimization, energy harvesting, and partial offloading effectively curbs the steep rise in remote centralized computing energy consumption, causing the performance gap with the “No Caching”, “No EH”, and “Full Offloading” schemes to widen significantly. By contrast, its advantages over the “no STAR-RIS” scheme remain almost unchanged, because the proposed STAR-RIS passive beamforming strategy mainly affects the transmission energy from UDs to the distant BS but does not affect the energy consumption of remote computing. Consequently, as the BS computation power increases, the gap between our solution and the “No STAR-RIS” benchmark remains nearly constant.

images

Figure 4: Total energy consumption vs the computation capacity of the BS.

Fig. 5 illustrates that, as the number of CPU cycles required to process one bit of task data increases, more computing resources are needed to handle tasks of a fixed size, which in turn leads to a marked rise in computation-related energy consumption. Under limited computing capability at the UAV and UDs, a larger portion of tasks is offloaded to the remote BS, thereby further increasing both long-distance transmission energy and centralized computing energy. As the offloaded tasks grow, network resources become progressively more constrained. Under such resource-constrained conditions, our proposed joint strategy proves effective in significantly curbing the associated energy costs of long-distance transmission and remote centralized computing compared to baseline schemes. Consequently, the energy efficiency gap between our solution and the baseline schemes widens progressively.

images

Figure 5: Total energy consumption vs a function of CPU cycles per Bit.

Fig. 6 depicts the convergence behavior of the DRL-caching agent’s average weighted reward under different learning rates. It can be observed that the agent attains fast convergence in all cases, with the best overall performance achieved when the learning rate is set to 0.0003. When the learning rate is too large, temporal-difference errors have an excessive influence on critic updates, which may undermine the stability of the actor’s policy improvement. In contrast, an overly small learning rate slows down the propagation of value estimates through the neural network, resulting in sluggish learning dynamics. These results highlight that a proper choice of learning rate is essential to strike a balance between convergence speed and training stability, and thus to obtain satisfactory performance.

images

Figure 6: Average cumulative weighted reward vs different learning rates of the caching DRL agent.

7 Conclusions

In this paper, we propose an energy-efficient ASTAR-RIS and WPT-assisted task offloading and content caching framework to address the problems that restrict UAV endurance, underutilized network caching and computing resources, and inefficient resource allocation in WSNs. In this framework, by integrating WPT for continuous energy harvesting and mounting STAR-RIS on the UAV, energy efficiency is optimized, extending UAV endurance and improving task processing efficiency. Furthermore, we construct a minimization problem that jointly optimizes content caching, energy harvesting time, task offloading, and STAR-RIS resource allocation decisions to minimize energy consumption. To address the non-convex problem of system energy consumption minimization, a joint DRL–SCA–based algorithm is designed, which iteratively achieves the solution and attains near-optimal performance with relatively low computational complexity. Simulation results show that the proposed framework substantially lowers the total energy consumption in WSNs while achieving a rapid convergence behavior.

As an important direction for future work, we will consider extending the proposed framework to jointly optimize the UAV hovering location together with WPT-enabled energy harvesting, caching, offloading, and STAR-RIS configuration to further improve system energy efficiency. Furthermore, regarding UD scalability, the proposed framework can accommodate a practical number of UDs. However, as the number of UDs increases, resource contention becomes more severe, and the energy and latency constraints may tighten, which calls for careful performance–complexity trade-offs. For large-scale deployments, hierarchical optimization and multi-UAV cooperative architectures are promising directions for future work. From a practical implementation perspective, the proposed framework is compatible with current UAV, WPT, and STAR-RIS hardware technologies. As another important direction for future work, we will conduct prototype-based evaluations and field trials to assess the performance under practical hardware limitations. The last promising direction is the integration of additional renewable energy sources, such as solar power, to create hybrid energy harvesting systems. Solar-powered UAVs can extend operational endurance during daytime missions, while RF-WPT provides consistent energy availability regardless of lighting conditions. The joint optimization of multi-source energy harvesting, along with caching, offloading, and STAR-RIS configuration, presents an exciting avenue for future research toward fully sustainable WSN deployments.

Acknowledgement: Not applicable.

Funding Statement: This research was funded by the National Social Science Foundation of China (22CGL017).

Author Contributions: The authors confirm contribution to the paper as follows: study conception and design: Xiaoping Yang; data collection: Junqi Long; analysis and interpretation of results: Songjie Yang and Xiaoping Yang; draft manuscript preparation: Xiaoping Yang, Quanzeng Wang and Bin Yang; visualization: Guochao Qi; project administration and funding acquisition: Xiaofang Cao. All authors reviewed and approved the final version of the manuscript.

Availability of Data and Materials: The data used to support the findings of this study are available from the corresponding author upon request.

Ethics Approval: Not applicable.

Conflicts of Interest: The authors declare no conflicts of interest.

References

1. Shen J, Wang A, Wang C, Hung PCK, Lai CF. An efficient centroid-based routing protocol for energy management in WSN-assisted IoT. IEEE Access. 2017;5:18469–79. doi:10.1109/access.2017.2749606. [Google Scholar] [CrossRef]

2. Chettri L, Bera R. A comprehensive survey on Internet of Things (IoT) toward 5G wireless systems. IEEE Internet Things J. 2020;7(1):16–32. doi:10.1109/jiot.2019.2948888. [Google Scholar] [CrossRef]

3. Ji J, Zhu K, Yi C, Niyato D. Energy consumption minimization in UAV-assisted mobile-edge computing systems: joint resource allocation and trajectory design. IEEE Internet Things J. 2021;8(10):8570–84. [Google Scholar]

4. Liao Z, Yin G, Tang X, Liu P. A cooperative community-based framework for service caching and task offloading in multi-access edge computing. IEEE Trans Netw Serv Manage. 2024;21(3):3224–35. doi:10.1109/tnsm.2024.3372295. [Google Scholar] [CrossRef]

5. Zhang K, Gui X, Ren D, Li D. Energy latency tradeoff for computation offloading in UAV-assisted multiaccess edge computing system. IEEE Internet Things J. 2021;8(8):6709–19. doi:10.1109/jiot.2020.2999063. [Google Scholar] [CrossRef]

6. Gao X, Zhu X, Zhai L. AoI-sensitive data collection in multi-UAV-assisted wireless sensor networks. IEEE Trans Wireless Commun. 2023;22(8):5185–97. doi:10.1109/twc.2022.3232366. [Google Scholar] [CrossRef]

7. Duo B, He M, Wu Q, Zhang Z. Joint dual-UAV trajectory and RIS design for ARIS-assisted aerial computing in IoT. IEEE Internet Things J. 2023;11(10):17249–63. doi:10.1109/jiot.2023.3288213. [Google Scholar] [CrossRef]

8. Xiao H, Hu X, Wang W, Su Z, Wong KK, Yang K. STAR-RIS and UAV combination in MEC networks: simultaneous task offloading and communications. IEEE Trans Commun. 2025;73(8):6169–84. [Google Scholar]

9. Wu C, You C, Liu Y, Gu X, Cai Y. Channel estimation for STAR-RIS-Aided wireless communication. IEEE Commun Lett. 2022;26(3):652–6. doi:10.1109/lcomm.2021.3139198. [Google Scholar] [CrossRef]

10. Xiao H, Hu X, Mu P, Wang W, Zheng TX, Wong KK, et al. Simultaneously transmitting and reflecting RIS (STAR-RIS) assisted multi-antenna covert communication: analysis and optimization. IEEE Trans Wireless Commun. 2024;23(6):6438–52. doi:10.1109/twc.2023.3331706. [Google Scholar] [CrossRef]

11. Aung PS, Nguyen LX, Tun YK, Han Z, Hong CS. Aerial STAR-RIS empowered MEC: a DRL approach for energy minimization. IEEE Wireless Commun Lett. 2024;13(5):1409–13. [Google Scholar]

12. Singh CK, Kumar D, ki Lehtom J, Khan Z, Latva-Aho M, Upadhyay PK. Robust UAV-integrated active STAR-RIS RSMA networks: analysis with deep learning techniques. IEEE Trans Veh Technol. 2025;74(5):8297–302. [Google Scholar]

13. Zhai Z, Dai X, Duo B, Wang X, Yuan X. Energy-efficient UAV-mounted RIS assisted mobile edge computing. IEEE Wireless Commun Lett. 2022;11(12):2507–11. doi:10.1109/lwc.2022.3206587. [Google Scholar] [CrossRef]

14. Ye Y, Shi L, Chu X, Hu RQ, Lu G. Resource allocation in backscatter-assisted wireless powered MEC networks with limited MEC computation capacity. IEEE Trans Wireless Commun. 2022;21(12):10678–94. doi:10.1109/twc.2022.3185825. [Google Scholar] [CrossRef]

15. Li J, Dai M, Su Z. Energy-aware task offloading in the Internet of Things. IEEE Wireless Commun. 2020;27(5):112–7. doi:10.1109/mwc.001.1900495. [Google Scholar] [CrossRef]

16. Xu Y, Zhang T, Liu Y, Yang D, Xiao L, Tao M. UAV-assisted MEC networks with aerial and ground cooperation. IEEE Trans Wireless Commun. 2021;20(12):7712–27. doi:10.1109/twc.2021.3086521. [Google Scholar] [CrossRef]

17. Zhou R, Wu X, Tan H, Zhang R. Two time-scale joint service caching and task offloading for UAV-assisted mobile edge computing. In: Proceedings of the IEEE INFOCOM 2022—IEEE Conference on Computer Communications; 2022 May 2–5; London, UK. [Google Scholar]

18. Yang Z, Chen M, Liu X, Liu Y, Chen Y, Cui S, et al. AI-driven UAV-NOMA-MEC in next generation wireless networks. IEEE Wireless Commun. 2021;28(5):66–73. doi:10.1109/mwc.121.2100058. [Google Scholar] [CrossRef]

19. Liu Z, Li Z, Wen M, Gong Y, Wu YC. STAR-RIS-aided mobile edge computing: computation rate maximization with binary amplitude coefficients. IEEE Trans Commun. 2023;71(7):4313–27. doi:10.1109/tcomm.2023.3274137. [Google Scholar] [CrossRef]

20. Eghbali Y, Mohammadisarab A, Zarini H, Mili MR, Basar E, Renzo MD, et al. Integrated sensing and communication for STAR-RIS-Aided UAV networks. IEEE Trans Veh Technol. 2025;74(7):11638–43. doi:10.1109/tvt.2025.3546544. [Google Scholar] [CrossRef]

21. Chaudhary S, Nehra A, Budhiraja I, Chaudhary R, Bansal A. STAR-RIS based resource scheduling and mode selection for drone assisted 5G communications. In: Proceedings of the IEEE INFOCOM 2024 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS); 2024 May 20–24; Vancouver, BC, Canada. [Google Scholar]

22. Yang D, Li B, Niyato D. Energy-aware task offloading for rotatable STAR-RIS-enhanced mobile edge computing systems. IEEE Internet Things J. 2025;12(12):20239–50. doi:10.1109/jiot.2025.3542463. [Google Scholar] [CrossRef]

23. Li B, Yang D, Liu L, Niyato D. Aerial RIS-enhanced communications: joint UAV trajectory, altitude control, and phase shift design. IEEE Trans Wireless Commun. 2025;25:5830–45. [Google Scholar]

24. Budhiraja I, Vishnoi V, Kumar N, Garg D, Tyagi S. Energy-efficient optimization scheme for ris-assisted communication underlaying UAV with NOMA. In: Proceedings of the ICC 2022 IEEE International Conference on Communications; 2022 May 16–20; Seoul, Republic of Korea. [Google Scholar]

25. Mu X, Liu Y, Guo L, Lin J, Schober R. Simultaneously transmitting and reflecting (STAR) RIS aided wireless communications. IEEE Trans Wireless Commun. 2022;21(5):3083–98. doi:10.1109/twc.2021.3118225. [Google Scholar] [CrossRef]

26. Zhou F, Hu RQ. Computation efficiency maximization in wireless-powered mobile edge computing networks. IEEE Trans Wireless Commun. 2020;19(5):3170–84. [Google Scholar]

27. Huang L, Bi S, Zhang YJA. Deep reinforcement learning for online computation offloading in wireless powered mobile-edge computing networks. IEEE Trans Mob Comput. 2020;19(11):2581–93. doi:10.1109/tmc.2019.2928811. [Google Scholar] [CrossRef]

28. Yang X, Wang Q, Yang B, Cao X. Energy-efficient aerial STAR-RIS-aided computing offloading and content caching for wireless sensor networks. Sensors. 2025;25(2):393. doi:10.3390/s25020393. [Google Scholar] [PubMed] [CrossRef]

29. Luo Y, Ding W, Zhang B. Optimization of task scheduling and dynamic service strategy for multi-UAV-enabled mobile-edge computing system. IEEE Trans Cogn Commun Netw. 2021;7(3):970–84. doi:10.1109/tccn.2021.3051947. [Google Scholar] [CrossRef]

30. Alotaibi J, Oubbati OS, Atiquzzaman M, Alromithy F, Altimania MR. Optimizing disaster response with UAV-mounted RIS and HAP-enabled edge computing in 6G networks. J Netw Comput Appl. 2025;241(4):104213. doi:10.1016/j.jnca.2025.104213. [Google Scholar] [CrossRef]

31. Chen J, Xing H, Xiao Z, Xu L, Tao T. A DRL agent for jointly optimizing computation offloading and resource allocation in MEC. IEEE Internet Things J. 2021;8(24):17508–24. doi:10.1109/jiot.2021.3081694. [Google Scholar] [CrossRef]

32. Xu J, Liu Y, Mu X, Dobre OA. STAR-RISs: simultaneous transmitting and reflecting reconfigurable intelligent surfaces. IEEE Commun Lett Sep. 2021;25(9):3134–8. doi:10.1109/lcomm.2021.3082214. [Google Scholar] [CrossRef]

33. Ameur AI, Oubbati OS, Rachedi A, Arishi A, Atiquzzaman M. Intelligent UAV caching and energy management in 6G networks. IEEE Trans Netw Sci Eng. 2026;13:3175–92. doi:10.1109/tnse.2025.3628171. [Google Scholar] [CrossRef]

34. Su Y, Pang X, Lu W, Zhao N, Wang X, Nallanathan A. Joint location and beamforming optimization for STAR-RIS aided NOMA-UAV networks. IEEE Trans Veh Technol. 2023;72(8):11023–8. doi:10.1109/tvt.2023.3261324. [Google Scholar] [CrossRef]

35. Lin N, Bai L, Hawbani A, Guan Y, Mao C, Liu Z, et al. Deep-reinforcement-learning-based computation offloading for servicing dynamic demand in multi-UAV-assisted IoT network. IEEE Internet Things J. 2024;11(10):17249–63. doi:10.1109/jiot.2024.3356725. [Google Scholar] [CrossRef]

36. Zhang Q, Zhao Y, Li H, Hou S, Song Z. Joint optimization of STAR-RIS assisted UAV communication systems. IEEE Wireless Commun Lett. 2022;11(11):2390–4. doi:10.1109/lwc.2022.3204353. [Google Scholar] [CrossRef]

Cite This Article

APA Style

Yang, X., Yang, S., Long, J., Wang, Q., Yang, B. et al. (2026). Energy-Efficient ASTAR-RIS and WPT-Assisted Task Offloading and Content Caching for WSNs. Computers, Materials & Continua, 88(1), 21. https://doi.org/10.32604/cmc.2026.078105

Vancouver Style

Yang X, Yang S, Long J, Wang Q, Yang B, Cao X, et al. Energy-Efficient ASTAR-RIS and WPT-Assisted Task Offloading and Content Caching for WSNs. Comput Mater Contin. 2026;88(1):21. https://doi.org/10.32604/cmc.2026.078105

IEEE Style

X. Yang et al., “Energy-Efficient ASTAR-RIS and WPT-Assisted Task Offloading and Content Caching for WSNs,” Comput. Mater. Contin., vol. 88, no. 1, pp. 21, 2026. https://doi.org/10.32604/cmc.2026.078105

BibTex EndNote RIS

Copyright © 2026 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Energy-Efficient ASTAR-RIS and WPT-Assisted Task Offloading and Content Caching for WSNs

Abstract

Keywords

References

Cite This Article

597

184

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link