Open Access
ARTICLE
Heterogeneous Community Surveillance–Driven Physics-Informed Reformulation of Fine-Scale Convection–Diffusion Air Pollution Distribution
1 College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, Saudi Arabia
2 Department of Mathematics, Government College University, Faisalabad, Pakistan
* Corresponding Author: Saima Rashid. Email:
Computer Modeling in Engineering & Sciences 2026, 147(1), 27 https://doi.org/10.32604/cmes.2026.076957
Received 16 October 2025; Accepted 20 March 2026; Issue published 27 April 2026
Abstract
Air pollution poses a serious public health threat in developing countries such as Pakistan, where rapid urbanization and industrialization have intensified atmospheric contamination. Although mobile sensing deployed on public transportation expands spatial coverage beyond fixed monitoring stations, accurate high-resolution pollution mapping remains constrained by sparse observations, computational burden, neglected pollutant interactions, and limited interpretability. To address these challenges, this study proposes a unified physics-informed deep learning framework for fine-grained air pollution map reconstruction and joint multi-pollutant estimation. The framework integrates mobile and stationary monitoring data with atmospheric dispersion principles to enhance physical consistency under limited observational coverage. A physics-guided air pollution map reconstruction model (Phy-APMR) is developed to jointly estimate ozone (O3), particulate matter (PMKeywords
Air quality has become a critical factor in efforts to enhance living conditions in urban environments. Consequently, many municipalities have introduced policies aimed at reducing pollution originating from industrial activities and road traffic. Numerous studies have demonstrated that air quality has a direct impact on human health, being strongly associated with serious respiratory and cardiovascular diseases such as asthma, lung cancer, stroke, and lower respiratory infections [1]. The World Health Organization (WHO) estimates that around seven million premature deaths occur annually due to air pollution, with 91% of the global population living in areas that do not meet air quality guidelines [2]. Given these severe health impacts, fine-grained air quality monitoring has become a major global priority. Access to block-level pollution data enables urban residents to better plan outdoor activities and reduce exposure risks, while for municipal authorities, accurate detection of airborne pollutants supports effective air quality control and more efficient city management [3]. Air quality predictions assisted by mobility data have been shown to reduce health impacts from pollution [4].
Urban air pollution is currently monitored using both stationary sensors and mobile devices. Although official monitoring stations provide highly accurate data and are widely used for routine assessments in many countries, they are costly, bulky, and resource-intensive. Consequently, their deployment density is insufficient to capture fine-grained pollution patterns. Over the past decade, mobile sensing platforms have emerged as a complementary approach [5–7]. These systems employ low-cost sensors carried by humans or vehicles, enabling broader coverage and finer spatial resolution. While mobility-based monitoring improves the ability to capture localized variations, the collected samples cannot fully represent the entire spatiotemporal domain. Thus, inference algorithms remain essential for reconstructing fine-grained pollution fields and providing a comprehensive view of the urban environment.
Deterministic frameworks representing physical and chemical processes have long underpinned conventional air quality modeling [8,9]. In parallel, the widespread adoption of Smart City initiatives has driven a rapid expansion in the deployment of sensors and monitoring networks, with air quality monitoring emerging as one of the most prominent applications in these systems [10]. Recent studies increasingly exploit data collected in smart cities for air quality monitoring, particularly through the use of machine learning techniques to predict the spatiotemporal evolution of air pollution.
Traditional regression models, such as Linear Regression and Support Vector Machines, as well as classification approaches including Decision Trees and Random Forests, are progressively being superseded by deep learning methods based on neural networks (NNs). Among these, physics-informed NNs (PINNs) have gained considerable attention due to their ability to integrate observational data with governing physical laws [11,12]. Unlike convolutional NNs (CNNs), which are designed for regularly structured data, PINNs incorporate physics-based constraints directly into the learning process, enabling physically consistent modeling of complex spatial and temporal dynamics. This property is particularly advantageous for urban air quality monitoring, where pollution processes are governed by atmospheric transport mechanisms and heterogeneous emission sources. By embedding physical laws into the loss function, PINN-based models have demonstrated improved predictive accuracy and enhanced physical consistency in air pollution forecasting [13,14].
However, physics-based simulations and PINN frameworks can be computationally expensive and often require substantial prior knowledge, such as accurate initial conditions and emission inventories. To alleviate these limitations, low-cost air quality sensors mounted on mobile carriers, including vehicles and pedestrians, are increasingly deployed to extend monitoring coverage and improve spatial resolution at a manageable cost [15,16]. Concurrently, data-driven deep learning (DL) approaches have gained popularity due to the growing availability of empirical air quality data [17,18]. Despite their strong predictive capability, DL models are frequently criticized as “black boxes” because of their limited interpretability and weak linkage to physical mechanisms. Moreover, the continuous movement of mobile sensors results in uneven spatiotemporal data distributions, making it difficult to capture complete neighborhood observations. As a result, accurately modeling spatiotemporal dependencies and generating reliable pollution visualizations remains a significant challenge [19].
In the present problem, two major challenges are evident:
(i) Despite regulatory efforts, PM2.5 and PM10 levels in Pakistan still frequently exceed WHO health standards, especially in urban and industrial regions.
(ii) Evidence suggests rising O3 concentrations in major cities due to vehicular emissions and industrial activities.
(iii) Accurate spatiotemporal mapping of O3, PM2.5 and PM10 is essential for targeted pollution control policies.
(iv) Modeling challenges include:
• Irregular data from mobile sensing: Unevenly distributed air quality measurements complicate capturing spatiotemporal dependencies.
• Non-interpretability of DL techniques: Traditional DL models without physical grounding often fail to extrapolate reliably.
(v) Phy-APMR innovations are as follows:
• Categorizes inputs based on atmospheric knowledge.
• Uses tanh-based self-attention to model differential input impact.
• Deep interaction module captures complex physicochemical pollutant interactions.
• Hierarchical physics-constrained loss improves model robustness.
(vi) Adaptive short-time update sampling (ASUS) allows analysis of predictor contributions and identification of key pollution drivers.
(vii) Spatiotemporal distributions of O3, PM2.5 and PM10 can be effectively mapped across Pakistan, supporting reliable policy-making and environmental management.
Fine-grained Phy-APMR therefore remains essential, as many regions are still unmonitored despite the broader coverage enabled by mobile sensors. As illustrated in Fig. 1, wireless sensors mounted on automobiles collect atmospheric contaminant measurements, which are then used to develop a reconstruction framework that estimates pollutant concentrations in unobserved areas. In recent years, the design of highly efficient reconstruction algorithms has become a central focus of research.

Figure 1: A demonstration of the fine-grained Phy-APMR procedure. The reconstruction model predicts pollution levels and generates the corresponding visualization, while mobile sensors enhance sensing coverage.
Reconstruction algorithms are generally developed using two principal methodologies: data-driven strategies and physics-inspired approaches [20]. Physics-based methods typically rely on theoretical diffusion models [21] to estimate air quality; however, they often perform poorly because they cannot fully capture the complex dynamics of real-world pollution forecasting. In contrast, evidence-based techniques, such as probabilistic regression with Gaussian processes [22] and NNs [23,24], aim to learn diffusion patterns directly from observed data. When sufficient observations are available, data-driven methods particularly those based on DL have demonstrated clear advantages over physics-informed approaches. Nonetheless, data-driven methods still face two major limitations:
(

Figure 2: A representation of the limited availability of mobile sensing data across spatiotemporal domains. The green backdrop indicates that, during daytime, urban sensing vehicles tend to concentrate in busy areas of the city (such as central and commercial zones). The light-pink backdrop highlights this tendency, which leads to reduced sensing coverage in other areas and times (such as residential neighborhoods and nighttime).
(
To address the challenges of accurately estimating air pollutant concentrations in Pakistan during 2023–2024, this study proposes a hybrid framework that integrates DL with physics-informed modeling. The framework consists of the Phy-APMR reconstruction model, supported by the ASUS module for efficient training and updates. In Phy-APMR, model inputs are categorized based on prior atmospheric knowledge. A tanh-based self-attention mechanism dynamically adjusts the contribution of each input to different pollutants, while a deep interaction module captures complex physicochemical interactions. A physics-constrained loss function ensures predictions are consistent with physical laws.
Phy-APMR is designed for reliable reconstruction of air quality in regions with sparse or missing data. In areas with sensors, the model aligns predictions with observed data, while in unmonitored regions, it enforces physical principles governing pollutant dispersion. Its architecture integrates a NN for inferential computation and a PINN to encode domain knowledge, typically expressed through partial differential equations (PDEs), ensuring robust reconstruction even in data-scarce locations.
To improve training efficiency, the ASUS module selects representative collocation points through efficiency-driven sampling, coverage-oriented sampling, and periodic temporal updates. This strategy accelerates convergence, reduces computational costs, and maintains high inference accuracy. The proposed framework was validated using two datasets. First, a large-scale deployment dataset collected between 2023 and 2024 from 78 fixed detectors across Lahore, Faisalabad, and Karachi, comprising over 15 million observations, was used for model development and large-scale performance analysis. Second, a controlled 10-day three-city evaluation campaign involving approximately 70 active detectors and a substantially smaller sample subset was conducted to assess short-term generalization and convergence behavior. Across both settings, Phy-APMR outperforms state-of-the-art benchmarks by 16% in predictive accuracy. In addition, ASUS reduces training completion time by 84% relative to baseline sampling strategies, without compromising reliability.
The core contributions of this framework are summarized below:
(i) Phy-APMR as the first hybrid PINN-based technique for reconstructing ambient air quality across spatially and temporally sparse regions in Pakistan.
(ii) An enhanced Phy-APMR version incorporating wind speed and additional ecological variables, improving reconstruction accuracy.
(iii) The ASUS methodology, which accelerates training while preserving robustness.
(iv) Comprehensive evaluation across three major metropolitan areas in Pakistan, demonstrating significant improvements in both accuracy and efficiency.
Together, the Phy-APMR + ASUS framework provides a robust, interpretable, and efficient solution for simultaneous estimation and reconstruction of multiple air pollutants in Pakistan, offering actionable insights for coordinated air pollution control and environmental management during 2023–2024.
The datasets employed in this study cover in-situ measurements, outputs from remote sensing devices, and reanalysis data across Pakistan. To harmonize these datasets, a standard grid covering the study region was established with a spatial resolution of
Real-time ground-based measurements of O3, PM2.5, and PM10 from 2023 to 2024 were collected from 1194 monitoring sites across Pakistan (Fig. 3). These sites are operated by the Pakistan Environmental Protection Agency (Pak-EPA) and affiliated provincial monitoring networks. The mean daily maximum 8-h average O3 concentrations and daily mean PM2.5 and PM10 concentrations were computed after removing invalid or anomalous readings caused by instrument errors. For grid cells containing multiple monitoring sites, the average of all sites within the cell was used to represent the cell-level value.

Figure 3: Distribution of in-situ air quality monitoring sites across Pakistan.
2.2 Geospatial Remote Sensing Data
Satellite-derived data from the Tropospheric Monitoring Instrument (TROPOMI) and the Moderate Resolution Imaging Spectroradiometer (MODIS) spanning 2023–2024 were used to provide essential information for joint air-quality estimation across Pakistan. TROPOMI offers advanced atmospheric detection capabilities with near-global coverage at a spatial resolution of 3.5 km2
Aerosol Optical Depth (AOD) is a key proxy for PM2.5 and PM10 concentrations. MODIS-derived daily 1 km AOD products, generated using the Multi-Angle Implementation of Atmospheric Correction (MAIAC) algorithm, were obtained from NASA (https://ladsweb.modaps.eosdis.nasa.gov/). MAIAC AOD values greater than 3 were excluded based on the reported valid range (Khan et al. [25]). Missing AOD values due to cloud contamination were reconstructed following the approach detailed in our previous work (Bilal et al. [26]), with full methodology provided in the Supporting Information. These AOD datasets were similarly resampled to 0.05° resolution to maintain consistency with the TROPOMI products and the standard grid.
2.3 Reconstructed Climate/Air-Quality Data
For 2023–2024, meteorological and radiation data relevant to O3 and particulate matter, with a spatial resolution of 0.25°, were obtained from the fifth-generation European Centre for Medium-Range Weather Forecasts (ECMWF) reanalysis (ERA5, https://cds.climate.copernicus.eu/) to complement measurements collected by wireless sensor vehicles across Pakistan. The variables include hourly boundary layer height (BLH), relative humidity (RH), surface pressure (SP), surface net solar radiation (SSR), surface net thermal radiation (STR), downward UV radiation at the surface (UVB), 2-m temperature (T2M), 10-m U wind (U10), 10-m V wind (V10), and surface O3 mass mixing ratio (GO3).
In addition, atmospheric composition data from the Copernicus Atmosphere Monitoring Service (CAMS, https://ads.atmosphere.copernicus.eu/), with a spatial resolution of 0.75° and a temporal resolution of 3 h, were collected. These datasets include surface concentrations of CO (CAMS_CO), ethane (CAMS_C2H6), formaldehyde (CAMS_HCHO), hydrogen peroxide (CAMS_H2O2), hydroxyl radical (CAMS_OH), isoprene (CAMS_C5H8), nitrogen dioxide (CAMS_NO2), nitrogen monoxide (CAMS_NO), peroxyacetyl nitrate (CAMS_PAN), propane (CAMS_C3H8), sulfur dioxide (CAMS_SO2), aerosol optical depths for black carbon (CAMS_BCAOD), dust (CAMS_DUAOD), sulfate (CAMS_SUAOD), and sea salt (CAMS_SSAOD), and surface PM1, PM2.5, and PM10 concentrations.
All reanalysis and CAMS datasets were averaged to daily means and resampled to a 0.05° spatial resolution using bilinear interpolation to align with the standard grid used for the wireless sensor vehicles. This harmonization ensures consistency across datasets, enabling robust joint modeling and comprehensive spatial analysis of air quality.
Based on known relationships among O3, PM2.5, and PM10, the collected datasets for 2023–2024 in Pakistan were classified into four factor categories: O3-independent factors, PM-independent factors, shared factors, and interacting factors. Radiation strongly influences O3 formation by driving photochemical reactions, while it has minimal direct impact on PM generation (see Liu et al. [27]). Accordingly, O3-independent factors included radiation variables (SSR, STR, UVB), reanalyzed CAMS_GO3, and satellite-derived S5P_O3.
PM-independent factors consisted of AOD-related data, which represent the atmospheric extinction from surface to space, including MODIS AOD, S5P_AAI, and reanalyzed aerosol components (BCAOD, DUAOD, SUAOD, SSAOD), as well as CAMS surface PM1, PM2.5, and PM10. Shared factors comprised meteorological variables (U10, V10, T2M, RH, BLH, SP) and spatiotemporal indicators (longitude, latitude, and day of the year), which are commonly used in models for both O3 and PM and significantly influence pollutant transport and formation (see Souza et al. [28]).
Interacting factors included various precursor gases (S5P_HCHO, S5P_NO2, S5P_CO, CAMS_CO, CAMS_C2H6, CAMS_HCHO, CAMS_C5H8, CAMS_NO2, CAMS_NO, CAMS_PAN, CAMS_C3H8, CAMS_SO2) and oxidants (CAMS_H2O2, CAMS_OH), reflecting the chemical interactions that affect the formation and transformation of O3 and PM in the atmosphere (see Ma et al. [29]).
3.2 Air Pollution Mapping Model
The fine-grained Phy-APMR approach for Pakistan was developed to jointly estimate O3, PM2.5, and PM10 concentrations (Fig. 4). The categorized factors were initially transformed into high-dimensional abstract representations through a feature encoder. To address the data sparsity issue, an ASUS strategy was integrated with a cascade architecture consisting of an Attention module and an Interaction module, enabling the model to capture complex interactions among meteorological conditions, pollutant precursors, and air quality levels. Leveraging prior physical knowledge, a hierarchical physics-constrained loss function was applied to guide the estimations. Finally, the model outputs were interpreted to provide insights into the contributions of each predictor, supporting transparent and explainable air quality mapping across Pakistan.

Figure 4: Schematic diagram of the proposed fine-grained Phy-APMR model.
The deep NN-based FE was developed to transform the categorized factors into high-dimensional abstract representations. A typical deep NN comprises an input layer, multiple hidden layers, and an output layer, with neurons transmitting information through dense connections and performing nonlinear transformations via activation functions. Due to their strong representation learning capabilities and adaptability, deep NNs have achieved considerable success across numerous domains, including wireless sensor vehicles and related mobile sensing applications [30].
Since the primary goal of the FE was to map inputs into high-dimensional feature spaces, the standard including wireless sensor vehicles and related mobile sensing applications output layer was removed. In the fine-grained Phy-APMR framework, we implemented four feature encoders with identical architectures: FE(PM), FE(O3), FE(Share), and FE(Int). The computations in each layer can be expressed as:
where X denotes the input,
Each FE used three consecutive hidden layers with 512 neurons per layer, a configuration selected via grid search (see Supporting Information (
O3, PM2.5 and PM10 concentrations in Pakistan are strongly influenced by meteorological factors that vary across both space and time. Consequently, meteorological variables such as U10, V10, T2M, RH, planetary BLH, and SP along with spatiotemporal indicators like longitude, latitude, and day of year are typically incorporated into conventional separate estimation models.
In the attention module of our model, the shared features
Low wind speeds hinder the dispersion and dilution of both ozone and particulate matter, though their effects on pollutant accumulation can vary with local conditions [32]. Therefore, it is essential for the model to account for the differing magnitudes and directions of influence that shared features
The proposed self-attention (SA) mechanism, which has been successfully applied in various remote sensing and mobile sensing tasks, exhibits strong potential for adaptively identifying the relative importance of features under different conditions [33]. To account for both positive and negative contributions of the factors, the tanh activation function was employed in our model instead of the conventional softmax function. The mechanism can be expressed as:
where
Using this SA mechanism, the shared features
In addition to meteorological factors that influence O3, PM2.5 and PM10 concentrations across space and time in Pakistan, various atmospheric components play critical roles in the generation and transformation of these pollutants. During complex atmospheric chemical and physical processes, these components interact and react with one another, leading to temporal and spatial variations in
For instance, chemical precursors such as nitrogen oxides (NOx) and volatile organic compounds (VOCs) undergo photochemical reactions that deplete their concentrations while driving ozone formation [26]. Simultaneously, PM is generated through the oxidation of NOx, VOCs, and sulfur dioxide (SO2) [27]. These particles can scatter or absorb solar radiation, modifying the amount of sunlight reaching the surface and the lower atmosphere, which in turn affects the photochemical reactions responsible for
Carbon monoxide (CO), another ozone precursor, also contributes to PM formation and can alter the balance of other precursors and atmospheric oxidants, thereby influencing both
Building on the first-level estimations, the interaction module was designed to integrate prior physical knowledge, thereby reducing estimation biases and enhancing model performance in Pakistan. The outputs of the final hidden layers of Net(O3) and Net(PM) contain rich information about O3, PM2.5 and PM10. These outputs were first concatenated with the abstract feature
The loss function quantifies the discrepancy between predicted and observed pollutant concentrations. By minimizing this loss, the model’s weights and parameters are optimized through backpropagation. The base loss function for joint estimation is the weighted sum of the mean squared errors (MSE) across all tasks:
where
where
According to the physical definitions, PM2.5 refers to particles with aerodynamic diameters below 2.5
Additionally, relying solely on final-level outputs for the loss can make the model susceptible to noise and local minima, potentially degrading performance. To provide richer supervision, the first-level outputs from the attention module were also included as an early-stage evaluation. The final hierarchical physics-constrained loss function is expressed as:
where
An Adam optimizer with an initial learning rate of 0.01 was used for backpropagation. Training proceeded for 1000 epochs, with the learning rate halved if the validation loss did not improve for 50 consecutive epochs.
For Pakistan during 2023–2024, the analysis-based framework enables accurate and interpretable modeling of metropolitan O3, PM2.5, and PM10 concentrations through the integration of data-driven learning with physics-informed constraints.
4.1 Research Challenges and Objectives
In terms of spatiotemporal standpoint, the density of a particular contamination in metropolitan areas is represented by
where
Optimizing the accuracy of the model at the present moment
where the squared error between the real and expected levels of pollutants is represented by the loss function. Meanwhile, the training data
4.2 Framework of the Phy-APMR Approach
The Phy-APMR approach is presented in this part of the article. A physical framework, which was developed using PDE to represent the processes of contaminants in the air transmission in a metropolitan area, is presented in Section 4.2.1. The complete design of Phy-APMR is then described in Section 4.2.2.
4.2.1 Physical Dynamics of Contaminant Dispersion
A PDE that describes the criteria of environmental pollution dispersal must be implemented in order to build a PINN-supported algorithm framework for air contamination visualization regeneration. Its diffusion formula, which characterizes the gigantic mechanics brought about by the Brownian motion of small fragments, is used in this investigation. Numerous fields, including structural study, computer science, and biology, depend significantly on the dispersion concept. It is represented in its generic state as
where
Three fundamental presumptions are introduced in order to modify the generic diffusing formulation for the environmental contamination image restoration issue in metropolitan environments:
Horizontal dispersion assumption: To address the challenge of reconstructing high-precision ambient pollutant concentration fields, this study formulates the problem within a two-dimensional horizontal framework. The modeling approach emphasizes horizontal dispersion driven by wind-induced advection and turbulent diffusion within the atmospheric boundary layer, while vertical transport processes are not explicitly resolved.
For PM2.5 and PM10, this assumption is physically justified by the dominance of near-surface sources and the comparatively limited vertical displacement of particles due to gravitational settling and surface deposition. Under typical urban boundary-layer conditions, PM concentrations exhibit strong horizontal variability and relatively weak vertical gradients within the lowest tens of meters, particularly at spatial resolutions exceeding several hundred meters.
In contrast, O3 is a secondary pollutant with stronger vertical mixing and photochemical production throughout the boundary layer. However, at the adopted horizontal mesh resolution of 500 m which substantially exceeds characteristic near-surface vertical fluctuation scales (
Accordingly, the spatial domain is defined as
where
To preserve computational efficiency and model interpretability, detailed vertical processes such as boundary-layer height variability, chemical transformation rates, and surface–atmosphere exchange fluxes are not explicitly modeled, although their inclusion could further enhance reconstruction accuracy [20].
Transparent boundary and initial conditions: To make the PDE implementation fully auditable, the boundary and initial conditions are explicitly specified as follows:
(i) Type: Dirichlet boundary conditions are applied for pollutant concentrations at the urban–rural interface, while initial conditions are fixed using the first available hourly measurements at all reference sites within the domain.
(ii) Mathematical form: For each pollutant
where
where
(iii) Enforcement in the loss function: These conditions are incorporated via boundary collocation sampling and a soft penalty term in the loss:
where
This approach is fully auditable and implementation-ready. The Dirichlet boundaries ensure mass conservation while permitting realistic advection and diffusion across the urban–rural interface. The perturbation term
Homogeneous diffusivity factor: Atmospheric pollutant diffusion is influenced by multiple external variables, including humidity, thermal stratification, wind shear, and surface roughness, all of which can modulate the effective eddy diffusivity
Explicitly modeling spatially and temporally varying diffusivity fields would significantly increase the complexity of the PINN optimization and may lead to identifiability and stability issues, particularly under sparse observational coverage. Therefore, for computational feasibility and model robustness, this study assumes a homogeneous effective diffusivity for ambient pollutant transport, expressed as
where
From a physical perspective, this assumption reflects the scale separation between the resolved horizontal grid size (500 m) and smaller-scale turbulent fluctuations, under which subgrid-scale mixing processes are reasonably approximated by a uniform diffusion coefficient. For PM2.5 and PM10, this effective diffusivity encapsulates the combined influence of turbulent dispersion and near-surface mixing, while for O3, it represents the net impact of boundary-layer turbulence on horizontally averaged concentrations near the surface.
Under these assumptions, the diffusion formulation in (4) reduces to
Here, the superscripts ′ and ″ denote first- and second-order partial derivatives with respect to the corresponding variables.
4.2.2 Architectural Process of of Phy-APMR
A NN and a physics-informed network constitute the two primary components of the PINN architecture, as illustrated in Fig. 5. The physics-informed network explicitly enforces the governing PDEs described in Section 4.2.1, which model the physical processes of horizontal advection, turbulent diffusion, and source-sink dynamics of pollutants such as O3 and PM2.5/PM10 within the urban atmospheric boundary layer (see [35]).

Figure 5: A representation showing the layout of Phy-APMR: (i) The NN module, composed of two outcome neurons,
The NN module approximates the pollutant concentration field as a continuous function over space and time, while the physics-informed network constrains this approximation to satisfy the underlying physics, ensuring that the reconstructed fields adhere to mass conservation, diffusion limits, and boundary conditions consistent with urban–rural interfaces.
The details of the NN architecture, the physics-informed network formulation, and the training procedure including loss evaluation, automatic differentiation of the PDE residuals, and hyperparameter tuning are presented in the following sections of this paper.
The initial component of the architecture is a fully connected NN with three input neurons representing the spatiotemporal coordinates
To capture the complex, non-linear relationships inherent in urban-scale pollutant transport including advection, horizontal diffusion, and source-sink interactions, we integrate an “attention-based module” that dynamically weights contributions from different spatial regions, allowing the network to prioritize areas of high variability or strong pollutant fluxes. Complementarily, an “interaction-based module” is employed to explicitly model cross-couplings between pollutant species and between urban and surrounding rural zones, reflecting the physical interplay of emissions, atmospheric mixing, and boundary-layer processes.
The sine (
4.2.4 Physics-Informed Network
The physics-informed network (PINN) is a specialized neural network whose structure and evaluation are explicitly governed by the underlying PDEs representing pollutant transport. Unlike a fully connected NN module, the physics-informed network does not introduce additional trainable parameters. Its primary purpose is to compute the physical residual, which quantifies the discrepancy between the network predictions
To ensure that the auxiliary source term
where
•
•
•
•
This regularization applies specifically to the auxiliary source term
The PDE residual, boundary/initial condition penalties, and auxiliary source regularization are combined to form the total training loss for the PINN:
During training, the physical residual acts as an additional constraint, effectively enforcing adherence of the network to the governing PDEs at each spatiotemporal location. In this way, the physics-informed network serves as a regularizing mechanism, ensuring that predicted pollutant fields remain physically consistent even in regions with sparse or missing observational data.
Specifically, the physics-informed residual is defined as:
Here,
Physically, this formulation incorporates multiple constraints simultaneously: horizontal pollutant diffusion is captured via the second-order spatial derivatives, local emissions or auxiliary ecological effects are represented by
Optimization Process
In Phy-APMR, training is guided by two complementary types of loss: a statistically determined loss and a physics-driven loss. The statistically determined loss ensures that the network’s predictions closely match observed sensor data, maintaining fidelity to real-world measurements. The physics-driven loss, on the other hand, enforces the fundamental physical laws governing pollutant behavior, including advection by wind, diffusion and turbulent mixing, chemical transformations, emission sources and sinks, boundary and initial conditions, and meteorological influences. By balancing these two components, the network is directed to produce predictions that are not only accurate but also physically consistent and environmentally realistic, enabling reliable mapping even in areas with sparse observational data.
• Supervised loss: The initial training dataset is defined as
• Physics-informed loss: The subsequent loss component, referred to as the physics-based loss, guarantees that the NN predictions remain consistent by means of the governing mechanical principles. This is realized by selecting
Descriptions of pollution concentrations were not associated with these collocation locations. Instead, the physical loss is formulated through the physical residuals,
• Integrated loss function: By integrating the statistical loss with the physics-driven loss, the combined loss function ensures that the model’s predictions remain accurate while consistently adhering to the governing physical laws of pollutant transport in the atmosphere. This formulation achieves a balance between empirical precision and physical coherence, accounting for key factors such as advection by wind, turbulent diffusion, chemical transformations, emissions, deposition, and meteorological influences. The combined loss is expressed as:
where
where
The main goal of the training process is to minimize the total loss:
Throughout this research, feature scaling is applied to adjust the NN inputs to the interval
4.3 Methodology for Phy-APMR-WS
More comprehensive ecological data, including wind speed (WS), which is critical for the advection and dispersion of air pollutants, may be incorporated into
4.3.1 Pollutant Dispersion via Convection–Diffusion
The rapid advancement of database management technologies and urban monitoring infrastructure has enabled the collection of high-resolution ecological data, including wind velocity fields, turbulent fluctuations, and spatially resolved pollutant emission sources [20]. The convection–diffusion equation, which generalizes the classical dispersion model by explicitly incorporating advection, diffusion, and reactive processes [21], provides a rigorous framework to integrate these ecological factors into the physical dynamics of atmospheric pollutant transport. By accounting for the combined effects of bulk wind advection, turbulent mixing, chemical transformations, deposition, and localized emission patterns, this approach allows for a mechanistically grounded representation of pollutant dispersion across complex urban environments.
The wind velocity field, denoted by
The corresponding expression for the energy function is given as:
The full expression of the propagation equation is formulated as:
where the expression
Assuming that pollution sources remain relatively stable over short time intervals, their impact on atmospheric pollutant concentrations is often modeled primarily as a function of spatial location [19]. This assumption generally holds for continuous emitters such as industrial facilities and power plants. However, in complex urban environments, additional time-varying contributors including traffic peaks, restaurant cooking emissions, and hotel exhaust play a significant role in shaping local air quality. When leveraging mobile sensors, which provide measurements at dynamically changing locations, it becomes essential to account for both spatial and temporal variability in emissions. To capture these effects, the emission source term
4.3.2 Phy-APMR-WS Architecture
This scenario is addressed by the Phy-APMR-WS system, an extension of the original Phy-APMR framework that explicitly integrates physical laws governing atmospheric pollutant transport:
(i) High-resolution data on wind velocity and pollutant concentrations are available, represented by the real dataset
(ii) Although direct measurements of emission sources are not available, the Phy-APMR-WS framework leverages a PINN architecture to implicitly model pollutant generation, advection, diffusion, chemical transformation, and deposition during training. By enforcing these physical constraints, the system effectively reconstructs the influence of unobserved sources on observed concentrations, thereby enhancing the physical consistency and interpretability of the atmospheric indicators learned by the network.
The NN’s output layer consists of four neurons, which distinguishes the Phy-APMR-WS model from the original Phy-APMR. Specifically, the external variable
However, the NN is trained on the database
In the Phy-APMR-WS model, as in the original Phy-APMR, the physics of pollutant transport is explicitly incorporated by enforcing the convection–diffusion PDE residual in the total loss. This residual captures the advection of pollutants by the predicted wind field, turbulent diffusion, and, if applicable, reaction, deposition, and source emission processes, ensuring that the network predictions obey the fundamental physical laws of atmospheric transport. Formulation (21) specifies the estimated PDE residual at a given spatiotemporal location
The
where
4.4 ASUS: Approach and Implementation
Providing rapid and reliable measurements of atmospheric pollutants is essential for the accurate reconstruction of high-resolution pollution maps. These tasks require continuous inference that not only captures observed concentrations but also respects the underlying physical laws of pollutant transport. A key practical challenge arises: How can the training and inference process of Phy-APMR, which enforces both data fidelity and physics constraints, be significantly accelerated without compromising physical consistency? To address this, we propose the ASUS method, which strategically selects spatiotemporal data points to efficiently update the network while maintaining adherence to the governing convection–diffusion dynamics of atmospheric pollutants.
As mentioned earlier, the size of the empirical observation dataset
The dimensions of the actual dataset and the combined training set are denoted by
To dynamically reduce
The ASUS algorithm incorporates three key strategies: “Sampling Reduction”, “Representative Sampling”, and “Short-Time Update”. The principles behind these strategies are illustrated in Fig. 6. For clarity, the 3D spatiotemporal domain has been projected onto a 1D representation in the figure.

Figure 6: The figure illustrates the three main strategies employed by ASUS. The 3D dynamical domain has been projected onto 1D for clarity. The first strategy, “Efficiency-driven sampling,” aims to decrease the total number of collocation points, thereby reducing the training time. When combined with “coverage oriented sampling,” this approach preferentially selects collocation points in regions with higher PDE loss, which are often more informative about the underlying function structure. The third strategy, “temporal update,” enhances distribution across the attributed space at the beginning of experiment by resampling the set of collocation points every
By drastically lowering
However, for atmospheric pollutants such as
An indiscriminate reduction of collocation points may therefore limit the model’s ability to resolve sharp spatiotemporal gradients, episodic pollution events, and chemical regime transitions (e.g., VOC-limited vs.
The residual-based adaptive refinement (RAR) remodelling technique [37] serves as the foundation for the subsequent mechanical procedure. Its core principle is to enhance gradient-based optimization by preferentially selecting collocation points associated with higher PDE residuals, denoted as
Within the proposed adaptive screening step, newly sampled collocation locations exhibiting updated PDE residuals,
To formalize this criterion, a threshold parameter
where
By adaptively emphasizing collocation points with elevated PDE residuals, the method systematically captures dominant physical and chemical processes, including pollutant transport, turbulent mixing, photochemical ozone formation, secondary particulate matter generation, and deposition mechanisms. Consequently, the proposed Phy-APMR framework improves training efficiency, reduces the total number of required iterations, and enhances the physical consistency and generalization capability of the learned solution for multi-contaminant air quality modelling.
The collection of PDE collocation points is generated arbitrarily from the characteristic (or specification) region at the beginning of training. In conventional approaches, these collocation sites remain fixed throughout the training process. To ensure sufficient domain coverage, a significant quantity of stationary collocation locations must be employed, which substantially increases the computational cost. Moreover, relying solely on the emphasized optimization technique may lead to insufficient representation of the domain.
We propose a short-time implementation strategy to address this issue, in which collocation points are resampled and updated every
Algorithm 1 presents the complete ASUS methodology. The ASUS framework accelerates the training phase of the Phy-APMR technique without compromising accuracy by integrating monitoring optimization, descriptive evaluation, and immediate amendment. A detailed evaluation is provided in Section 5.10.

In this section, the experimental settings are introduced in Section 5.1. Section 5.2 presents a performance evaluation of Phy-APMR against baseline methods. In Section 5.7, the robustness of Phy-APMR under varying data sparsity levels is examined. Section 5.8 provides a comparison of performance across different input dataset durations and model recalibration intervals. Finally, Section 5.10 evaluates the effectiveness of the ASUS algorithm.
5.1.1 Stationary and Moving Sensors
Understanding that fixed detectors have a restricted geographic area, we implanted wireless sensors on public transportation and automobiles in a systematic way to extend geographical coverage. We purposefully installed autonomous sensors on transportation vehicles to extend geographic coverage after recognizing that fixed detectors had a limited viewing range. This flexibility enables the collection of data across diverse metropolitan scenarios and allows observation of a broader area. To ensure adequate data acquisition, our test setup incorporated monitoring equipment on both fixed and handheld platforms. Fig. 7a illustrates the appearance of the device, while Fig. 7b–d shows its deployment. Each monitoring unit consists of four primary components: the sensor, connectivity module, power supply, and control circuitry. (i) The detection component integrates a GPS module along with multiple pollution-monitoring sensors, including Plantower’s PMS5003 for measuring fine particulate matter PM2.5. The system is extended to account for PM10 and

Figure 7: Images of the sensor apparatus used in our study. Panel (a) represents the layout of the air pollution sensor. Panels (b–d) depict its installation on a stationary mount, a data-collecting vehicle, and a sensor-equipped bus, respectively. Panel (e) shows the climate-monitoring instrument used for measuring airflow velocity in Lahore.
(ii) The device is powered by an external generator source, ensuring stable operation of gas- and particle-sensing elements under varying environmental conditions and enabling continuous monitoring during extended deployment periods.
(iii) The transmission component enables real-time data transfer via Wi-Fi and 3G/4G networks to an encrypted cloud-based server, facilitating near–real-time assessment of PM10 and
(iv) A microcontroller embedded within the monitoring unit manages sensor synchronization, data acquisition, preprocessing, and communication tasks, ensuring reliable integration of particulate and ozone measurements with geospatial information.
As shown in Fig. 7e, miniature meteorological instruments were deployed in the Lahore study to record atmospheric parameters and to reduce the influence of urban vehicular circulation on the accuracy of PM2.5 measurements. In addition to PM2.5, these atmospheric conditions strongly influence the formation, dispersion, and accumulation of
To monitor near-surface, real-time atmospheric conditions, such sensors are commonly installed on unsecured rooftops of low- to mid-rise buildings. The devices are designed for continuous long-term operation using household electrical power. The meteorological data collected are essential for interpreting variations in PM2.5, PM10, and
Overall, this integrated monitoring configuration ensures that the study accurately captures circulation dynamics and environmental conditions relevant to particulate matter and ozone exposure, thereby improving the robustness and precision of the collected air-quality data.
To ensure data accuracy, all collected measurements underwent an extensive preprocessing and cleaning phase. Mean values were computed within each predefined sensorimotor region, after which the Three-Sigma Threshold was applied to identify and remove anomalous observations. This process improves data reliability by mitigating the influence of outliers, thereby strengthening the validity of comparative analyses across multiple sensing devices.
To further enhance study quality, peripheral regions with insufficient data coverage were excluded from the analysis. As shown in Table 1, this filtering step resulted in effective study areas of approximately 125 km2 in Lahore, 62 km2 in Faisalabad, and 38 km2 in Karachi, ensuring consistency and comparability across regions.

The remaining measurements were discretized using a spatial resolution of 0.5 km
To evaluate sensor reliability, measurements from the sensing devices (
Standard performance metrics, including the coefficient of determination (
Finally, the degree of correlation
where
In addition to correlation, the accuracy of each monitoring device was evaluated using the
Here,
For example, Table 2 presents the Pearson correlation,

5.1.3 Experimental Data Splitting and CV Protocol
To ensure reproducibility of the experimental split protocol, we explicitly describe the data partitioning strategies used for training and validation of the proposed model. Two cross-validation (CV) strategies were employed: sample-based CV and grid-based CV, designed to assess both overall predictive accuracy and spatial generalization across urban monitoring networks.
Sample-based CV:For sample-based CV, the dataset was randomly partitioned at the time-piece level into five subsets of approximately equal size (20% each). In each iteration, four subsets were used for training and the remaining subset for validation. This process was repeated five times, ensuring that each sample was used once as validation. Samples were drawn uniformly across all spatial locations and time periods. This strategy evaluates the model’s ability to predict pollutant concentrations when both spatial and temporal distributions are represented in training and validation sets.
Grid-based CV: For grid-based CV, the spatial domain was partitioned into contiguous blocks corresponding to 0.5 km
The sample-based and grid-based CV results for

Figure 8: Density scatterplots of the proposed model performance for
For
For PM2.5, sample-based (grid-based) CV
Spatial robustness was evaluated across individual monitoring sites in Pakistan. Overall,

Figure 9: Evaluation results of the proposed model for O3, PM2.5, and PM10 in 2024 at reference sites.

Figure 10: Monthly model estimation errors for O3, PM2.5, and PM10 in Pakistan during 2024. (a) Spatial distribution of
Overall, these results indicate that the proposed model effectively captures daily and seasonal variations of
As shown in Table 3, the model achieved an

The most recent data obtained from the sensor network are used to train the Phy-APMR model at each successive time slice. Specifically, at each period, 25% of the data are selected to form the test set, while the remaining 75% constitute the training set, representing the information available in the real-world context. Each successive segmentation during evaluation involves updating the system. To develop a predictive algorithm, the data available at the current time
An additional portion derived from the experimental data is reserved as a assessment dataset to enable incipient stopping and hyperparameter calibration, thereby preventing overfitting. Table 1 summarizes the data collection details for each of these locations.
The prediction error in our study is evaluated utilizing the Mean Absolute Percentage Error (MAPE), stated as:
where
Seven reference methods are used for comparison with Phy-APMR:
• Central value: The simplest approach uses the mean of all training data for the given time segment to generate the predicted outcome.
• Parametric spline approximation (PSA): A spline-based, component-wise interpolating polynomial method is employed. The PSA technique is implemented using Python’s scipy module.
• Physics-informed difference technique (PIDT): A conventional method for analyzing PDEs is employed. PIDT first uses PSA to fully reconstruct the initial conditions over a two-dimensional grid. At each time step, PIDT (i) applies the finite difference method to solve the equations and determine ambient pollution levels at each site, and (ii) incorporates newly collected data to correct any inaccurate predictions. For sites where additional measurements such as wind data are available (e.g., Lahore), these inputs are used to enhance predictions; for other locations, only standard corrections based on local sensor data are applied. The final results are obtained by iteratively repeating this procedure. The entire process is implemented in Python using the pde module.
• Feedforward NN (FFNN): The NN component is retained in this baseline model, while the physics-informed network element is removed. Physical-fine-grained APMR and FFNN can then be compared to evaluate the contribution of the physics-informed network component. In particular, this is accomplished by
• Stochastic process regression (SPR): Following its initial introduction by Cheng et al. [38] within the AirCloud framework, SPR has become widely adopted as an analytical method for reconstructing air quality estimates. SPR employs probabilistic inference to estimate Gaussian probability distributions for unmeasured regions and assumes that the spatial correlations of pollutant levels across different hypothetical regions follow a Gaussian process. SPR performs particularly well when substantial data are available. In this study, SPR is implemented using Python’s GPflow library.
• AQI-M3: This approach, introduced by Liu et al. [39], addresses the challenge of improving the fidelity of atmospheric pollutant estimates. AQI-M3 decomposes the reconstruction task into two complementary perspectives: a temporal (itinerary) view and a spatial (vicinity) view. Using encoder–decoder architectures, the model integrates data-driven sensor fusion with vehicle tracking information.
• Hybrid model-enabled sensing system (HMSS): The HMSS, first introduced by Chen et al. [40], integrates data-driven validation with mathematical PDE-based prediction. Specifically, it employs computational filtering to incorporate SPR-based adjustments within a PDE framework, producing hybrid predictions for the reanalysis of air quality landscapes.
All experiments were implemented using the TensorFlow 2.4.0 framework with Python 3.8 and trained on an NVIDIA GTX Titan X GPU with 12 GB memory. The number of training iterations was fixed at
5.2 Analysis of Phy-APMR Outcomes
The classic FFNN performs the worst among all methodologies due to inadequate learning caused by the small size of the training set (see Table 4). The efficiency achieved by the PIDT approach is limited because the PDE used does not fully capture the shifting patterns of atmospheric pollution transmission. The basic central value approach works well in Faisalabad and Karachi, where statistical volatility is minimal. PSA, on the other hand, produces slightly higher-quality outcomes for Lahore, where variations in pollution levels are more pronounced. AQI-M3 performs similarly to the Central value and PSA approaches, showing no discernible improvements in this context, as it was primarily designed to enhance clarity of contamination estimates.

SPR, a widely used reconstruction technique, outperforms purely data-driven methods by incorporating more accurate spatiotemporal correlations in pollution data. Hybrid methods such as HMSS and Phy-APMR perform better than other existing approaches by integrating both physical and data-driven information, with Phy-APMR achieving the highest overall accuracy. As discussed in Section 5.7, this improved performance is particularly evident during periods of data scarcity. Within the evaluation set, Phy-APMR outperforms the best benchmark method, HMSS, by 16.4%, 12.7%, and 15.9% in Lahore, Faisalabad, and Karachi, respectively. Its robustness is further supported by comparable standard deviation values of 0.42, 0.56, and 0.37 across the three cities. For clarity, wind-enhanced evaluations are applied only in Lahore, where wind measurements are available. In Faisalabad and Karachi, model performance reflects standard corrections based solely on local sensor data, without wind-based adjustments.
Datasets collected by the Pakistan Ecological Protection Agency are used to comprehensively evaluate the simulations. Six, three, and three reference monitoring stations are located in Lahore, Faisalabad, and Karachi, respectively, within the tested regions over various periods. These stations use high-accuracy instruments and provide hourly data, in contrast to low-cost portable detectors. The laboratory results, summarized in Table 5, demonstrate the reliability of Phy-APMR, which consistently outperforms all other baselines in terms of MAPE. In Fig. 11a, a national air quality monitoring station in Lahore (highlighted with a red box) is shown together with nearby low-cost monitoring devices providing training data within the simulated region. Fig. 11b,c illustrates the spatial configuration of reference monitoring stations and surrounding device installations for Faisalabad and Karachi, respectively. In all three cities, the highlighted reference sites are embedded within clusters of monitoring devices, enabling the proposed model to learn local spatial variability and support reliable reconstruction of pollution fields. Fig. 12a–i shows the predicted concentrations of O3, PM2.5, and PM10 for Lahore, Faisalabad, and Karachi. When compared to measurements from the reference monitoring stations, it is evident that Phy-APMR provides reliable forecasts throughout the full one-day period, as indicated by the consistently low MAPE values.


Figure 11: Lahore, Faisalabad and Karachi air quality chart at 11:00 AM on 1 December 2024. Shaded dots represent the data acquisition devices, such as both stationary and vehicle-mounted monitoring spots. The red squares indicate the locations of the Pakistan Ecological Protection Agency.

Figure 12: The one-day performance of Phy-APMR indicates precise forecasting, having minimum MAPE scores relative to the actual measurements from Pakistan Ecological Protection Agency in Lahore, Faisalabad and Karachi.
Traditional DL models are often regarded as “black boxes” due to their limited interpretability. To address this, we applied ASUS technique to our proposed model, highlighting the predictors that contributed most to the joint estimation of O3 and PM concentrations (see Fig. 13).

Figure 13: Interpretation of predictor contributions to air quality in Pakistan for 2024.
For 2024, the top five predictors were S5P_HCHO, S5P_CO, CAMS_OH, CAMS_HCHO, and T2M, with median relevance scores of 0.084, 0.062, 0.058, 0.050, and 0.046, respectively. In 2023, CAMS_NO2 and T2M ranked fifth and sixth. S5P_HCHO, representing total column formaldehyde, achieved a maximum relevance score of 0.602, while surface-level formaldehyde (CAMS_HCHO) reached 0.462. Formaldehyde is a key precursor for O3 and PM formation through photochemical and in-cloud processes (Dovrou et al. [41]), making it a critical predictor.
CO, mainly emitted from biomass burning and fossil fuel combustion, is another important precursor for O3 and PM, showing strong correlations with both pollutants (Jiang et al. [42]). OH, which reflects the atmospheric oxidation capacity, regulates O3 and PM concentrations and interacts with CO to influence the balance of other precursors and oxidants (Su et al. [43]). Temperature (T2M) influences photochemical reaction rates, heterogeneous chemical processes, and pollutant dispersion, with a relevance score of 0.407. NO2 is another major precursor, contributing to both O3 and secondary organic aerosol formation, highlighting the significance of CAMS_NO2 in joint predictions (Liu et al. [27]).
Independent predictors, such as CAMS_SSAOD, CAMS_DUAOD, S5P_O3, CAMS_GO3, and STR, had median relevance scores below 0.003 and generally contributed less than shared or interacting factors. For example, AOD reflects total aerosol loading and is more informative for PM than for O3, underscoring the value of considering interactions among pollutants and meteorological conditions for joint estimation [26].
Spatial heterogeneity of predictor importance was evident across Pakistan (Fig. 14). In Southern Pakistan, the top five predictors largely matched the overall results, though S5P_CO ranked fifth and CAMS_HCHO second. HCHO has been shown to contribute significantly to O3 and secondary aerosol formation in these regions (SPR). In Central and Eastern Asia, CAMS_NO2 replaced T2M and S5P_CO, reflecting the influence of local anthropogenic emissions. CAMS_SO2 gained importance in Northern Pakistan due to its role in sulfate formation, secondary PM production, and indirect effects on photochemical O3 generation (Mohyuddin et al. [32]). In Northwestern Pakistan, natural environment predictors, such as latitude and elevation, were more dominant due to the sparse anthropogenic influence. These regional variations emphasize the need to consider local characteristics for accurate air quality modeling and coordinated pollution control strategies.

Figure 14: Interpretation of predictor contributions to air quality in Pakistan for 2024.
Seasonal effects were also notable (Fig. 15). During summer (June–August), T2M was the second most important predictor, reflecting the dominant influence of high temperatures on photochemical reactions and emissions from both anthropogenic and biogenic sources. In winter (January, February, December), precursors such as CO, HCHO, NO2, C2H6, SO2, C3H8, and PAN were relatively more important due to lower temperatures and increased emissions from heating activities (Bilal et al. [26]). Spring and autumn exhibited patterns similar to the overall results, with both environmental and precursor indicators among the top 10 predictors. Importantly, we incorporated fine-grained Phy-APMR for Pakistan and mobile sensing device data, enabling a high-resolution assessment of spatial and temporal variations in air quality. These additional datasets enhanced the interpretability of the model and provided critical insights for targeted pollution control and management strategies across different regions and seasons.

Figure 15: Seasonal interpretation of the top ten most influential predictors in 2024: (a) Spring, (b) Summer, (c) Autumn, and (d) Winter.
The daily O3, PM2.5, and PM10 concentrations across Pakistan were effectively estimated using the fine-grained Phy-APMR. Their annual spatial distribution maps are presented in Fig. 16. The annual mean O3 concentrations in Pakistan were 100.48

Figure 16: Maps showing (a) O3, (c) PM2.5, and (e) PM10 concentrations across Pakistan in 2023, and (b) O3, (d) PM2.5, and (f) PM10 concentrations across Pakistan in 2024.
For particulate matter, the annual mean PM2.5 concentrations were 27.05
Moreover, regions with severe PM pollution extended to Northwest Pakistan, particularly around the Thar Desert and surrounding arid areas. Frequent dust storms and windblown dust events contributed to extremely high PM2.5 and PM10 concentrations in this region. Due to the large particle size of dust, PM10 levels were significantly higher than PM2.5. In contrast, the northern mountainous regions and parts of Balochistan exhibited relatively low PM levels, which were accompanied by comparatively lower O3 concentrations. These areas benefit from enhanced natural dispersion due to topographical features, such as mountainous terrain and higher vegetation cover, which help reduce both particulate matter and O3 pollution. The observed differences between O3 and PM pollution highlight the necessity of implementing effective, region-specific strategies for air quality management in Pakistan. Seasonal patterns were broadly consistent between 2023 and 2024; therefore, only the 2024 results are presented here.
Regarding seasonal variations in O3, different regions exhibited generally similar trends but with notable local distinctions. Most areas experienced the lowest O3 concentrations in winter, while spring and summer months saw elevated O3 levels. However, in southern Pakistan, O3 concentrations during summer were slightly lower than in winter, likely due to monsoon-driven rainfall and high precipitation events that promote pollutant washout and dilution. Autumn O3 levels were generally similar to those in winter, except in urbanized regions of Punjab and Sindh, where O3 pollution remained elevated, comparable to spring and summer, reflecting persistent anthropogenic emissions.
In northern Pakistan, in-situ observations sometimes indicate higher O3 concentrations in summer than in spring; our model showed slightly higher averages in spring (see Fig. 17). This difference can be attributed to the fact that the model produces regional-scale estimates, whereas observational measurements are often localized.

Figure 17: Model-estimated and in-situ observed seasonal concentrations of O3, PM2.5, and PM10 across different regions of Pakistan in 2024. Error bars indicate the standard deviation.
Conversely, PM2.5 and PM10 exhibited seasonal patterns opposite to O3, with the lowest concentrations during summer. Winter months were generally associated with the most severe PM pollution, due to stagnant meteorological conditions and increased anthropogenic emissions such as residential heating and biomass burning. PM concentrations in spring and autumn were generally comparable, except in Northwest Pakistan, where frequent dust storms in spring caused extremely high PM10 levels.
Overall, the annual and seasonal distributions of O3, PM2.5, and PM10 predicted by the model are consistent with available ground-based measurements and previous studies in Pakistan. High-concentration peak events were selected to further evaluate model performance at a fine spatial resolution (0.05°) (see Fig. 18). For O3, 20 August 2024 during summer represented a severe pollution event in the urban and industrial centers of Punjab, where O3 concentrations exceeded 170

Figure 18: High-concentration peak events for O3 and PM. (a) O3 on 20th August 2024, (b) PM2.5, and (c) PM10 on 28th February 2024. C, S, E, and N denote central, south, east, and north Pakistan, respectively. Blank areas represent missing data.
For PM, 28 February 2024 during winter represented a severe PM pollution episode across Central, Northern, and Eastern Pakistan, with peak PM2.5 concentrations exceeding 120
5.5 Impact of Each Module and Loss Function
The primary contribution of the proposed model lies in the adoption of a cascade architecture that integrates an attention module and an interaction module to address the complex coupling relationships among categorized factors collected from mobile crowd-sensing devices deployed in Pakistan. This design enables the joint estimation of O3, PM2.5, and PM10 by effectively capturing both shared environmental influences and inter-pollutant dependencies.
To examine the contribution of each component, an ablation study was conducted, as summarized in Table 6. The baseline model (Model_base) directly utilized all sensor-derived factors for the joint estimation of O3, PM2.5, and PM10 without distinguishing shared variables or explicitly modeling pollutant interactions and did not incorporate WS effects. Consequently, the model was unable to fully exploit the intrinsic relationships among the input features. The model without the attention module (Model_wa) failed to differentiate the varying impacts of shared meteorological and environmental factors on O3 and particulate matter, focusing only on the interdependencies among atmospheric pollutants measured by the sensor network. In contrast, the model without the interaction module (Model_wi) ignored the mutual physical and chemical interactions among pollutants, thereby limiting its capability for residual correction.

The experimental results clearly demonstrate the contribution of each module. The baseline model exhibited the poorest performance, achieving
Model_wi, which retained the attention mechanism while excluding the interaction module, achieved higher

The baseline loss function (
Compared with models trained using
Interestingly, the loss function combining both intermediate outputs and physics constraints (
5.6 Comparison with Estimating Pollutants Separately
To evaluate the effectiveness of the Phy-APMR model, separate estimation models for O3, PM2.5, and PM10 were also constructed. These separate models retained structures similar to the proposed model, including the FE, attention module, and interaction module, to ensure a fair comparison. The estimation results and model efficiencies are summarized in Table 8.

For O3 estimation, the separate model achieved
However, Phy-APMR demonstrated superior estimation accuracy for PM2.5 and PM10, with
In addition, the joint estimation approach significantly improved computational efficiency. By sharing hidden layers, the total number of model parameters for Phy-APMR was only 3.92 M, nearly half of that required by the combined separate models. Similarly, the training time per epoch on a single NVIDIA GeForce RTX 4090 GPU was reduced to 2.04 s for Phy-APMR, whereas training three separate models required 5.21 s per epoch. Moreover, separate estimation models necessitate redundant operations, such as multiple data reading and preprocessing steps, which increase practical workload [20]. In summary, through careful model design, Phy-APMR not only achieves higher estimation accuracy but also avoids duplication of effort and substantially enhances practical efficiency.
5.7 Robustness of Phy-APMR with Sparse Observations
Evidence obtained from both stationary and portable sensor systems is often sparse across certain spatiotemporal regions, as discussed in previous sections. In scenarios with limited data availability, traditional data-driven methods typically perform poorly, with accuracy declining sharply as the amount of training data decreases. In contrast, Phy-APMR leverages embedded scientific principles, effectively incorporating prior physical knowledge to overcome data limitations and enhance generalization.
To investigate this property, we reduced the proportion of the experimental dataset relative to the full information set, defining the degree of information sparsity as “1 minus the learning batch fraction.” The reference FFNN is excluded from these evaluations due to its poor performance under limited data. The performance of various methods across different training set sizes is presented in Fig. 19. The effectiveness of Phy-APMR declines only slightly with fewer observations; however, when the dataset variability increases from 0.5 to 0.9, the observed bias rises by 33%, 27%, and 30% in Lahore, Faisalabad, and Karachi, respectively (see Fig. 20). In comparison, under the same conditions, the second-best method, HMSS, exhibits error increases of 36%, 37%, and 40%, highlighting the superior robustness of Phy-APMR against limited data.

Figure 19: After individually reducing the proportion of training data for Lahore, Faisalabad, and Karachi, the performance of Phy-APMR and other benchmark techniques was evaluated. For Phy-APMR, the error increased by only 34%, 26%, and 31% across the three target cities, respectively, as in conditions of limited data availability (the fraction of data not included in the experimental test) increased from 0.5 to 0.9.

Figure 20: Taylor plots showing the efficacy of Phy-APMR and other benchmark techniques after individually reducing the proportion of training data for Lahore, Faisalabad, and Karachi. For Phy-APMR, the error increased by only 34%, 26%, and 31% in these three cities, respectively.
Hourly performance was also evaluated by dividing the data into 24 h intervals per day, to assess resilience under time-limited conditions. Karachi exhibited the largest day-to-night variation due to strict management practices. Fig. 21 presents the hourly performance, showing only the two best baselines (SPR and HMSS) for clarity. The histogram indicates training data availability each hour. The results show that hybrid approaches (HMSS and Phy-APMR) outperform the purely data-driven SPR, especially during hours with sparse data. Overall, Phy-APMR consistently exceeds all baseline methods, and during periods with higher data availability, it performs on par with or significantly better than SPR and HMSS.

Figure 21: Hourly performance of Phy-APMR, SPR, and HMSS in Karachi over a full day. Gray bars indicate the amount of learning data. Phy-APMR’s advantage over SPR and HMSS is particularly evident during periods with light green highlights when data is limited. In the afternoon, indicated by the coral background when more data is available, Phy-APMR continues to outperform or perform comparably to SPR and HMSS.
To further evaluate the effectiveness of Phy-APMR, separate estimation models for O3, PM2.5, and PM10 were constructed. These separate models retained the same FE, attention module, and interaction module as Phy-APMR to ensure a fair comparison. The estimation results and model efficiencies are summarized in Table 8. For O3, the separate model achieved
However, Phy-APMR achieved higher estimation accuracy for PM2.5 and PM10, with
Moreover, the efficiency of the estimation process is substantially improved. By sharing hidden layers, Phy-APMR requires only 3.92 M parameters, nearly half the total parameters of the three separate models combined. The training time per epoch on a single NVIDIA GeForce RTX 4090 GPU is reduced to 2.04 s, compared with 5.21 s for the separate models. Separate estimation also incurs redundant operations, such as repeated data reading and preprocessing [20]. In summary, Phy-APMR not only achieves higher estimation accuracy but also avoids duplication of work and substantially improves practical efficiency.
5.8 Phy-APMR Performance with Various Training Duration and Adaptation Intervals
Two primary approaches are employed to train Phy-APMR systems for reanalyzing air pollution patterns:
• Low-frequency update strategy: Monitoring information from a lengthy timeline is used for training the algorithm initially, after which it is maintained for an extensive amount of time. Illustrations of contaminants in the air are often reconstructed using such a developed technique.
• High-frequency update strategy: Preferably, present information collected over a condensed time period is used for training the algorithm. The framework is refined and reconfigured whenever new information appears readily accessible, leading to shortened lifespans and a higher degree of adaptability.
To evaluate the effectiveness of each approach, we build a demonstration that focuses on two hyperparameters, specifically (i) the initial training database’s temporal range (i.e.,
Examining the data shown in Fig. 22, the following inferences may be drawn:
1. Effect of update frequency: Rapid model updates are valuable because they significantly enhance interpretability during shorter time intervals (i.e., higher update frequency).
2. Effect of training time span: An optimal training duration exists for a given iteration interval. Performance can degrade when the interval is either too short or too long. Time correlations weaken with larger gaps because the reconstruction of pollution patterns is inherently an interval-based interpretation problem. Including large amounts of information exhibiting low correlation (i.e., a substantial

Figure 22: The following describes Phy-APMR’s performance across three cities under different training set durations and model update cycles: (i) Phy-APMR achieves significantly better results when the update cycle is shorter, meaning more frequent model retraining. (ii) Using training set durations that are either too short or too long can negatively affect prediction accuracy during ongoing model updates.
A trade-off is therefore necessary. According to the study results (see Fig. 22), the optimal approach for daily air pollution pattern reconstruction involves a single algorithm iteration paired with a training set spanning approximately two to three hours. However, most deep NN algorithms, including Phy-APMR, require several days to reach equilibrium, highlighting the need for the proposed ASUS architecture to enable fast and efficient learning.
5.9 Phy-APMR Performance in WS Analysis
The Phy-APMR wind velocity approach leverages WS data to enhance forecast accuracy, as discussed in Section 4.3. We conduct experiments on the Lahore pollution pattern reconstruction task to evaluate the performance improvement provided by incorporating meteorological information. In this scenario, the standard Phy-APMR model does not utilize wind direction data, whereas the Phy-APMR variant with atmospheric frequency does.
However, the advantage of incorporating meteorological data into the modeling process is demonstrated in Table 9, which shows that Phy-APMR with wind information achieves an improvement of nearly 13% compared to the standard Phy-APMR.

5.10 Performance Analysis of ASUS
The performance of the ASUS procedure, described in Section 4.4, is evaluated in this section. Its effectiveness is assessed by comparing ASUS with four representative benchmark methods:
• Uniform sampling: The most fundamental method, in which collocation spots were selected arbitrarily at the beginning of training and remain fixed throughout. Such considerations are repeatedly used during training. For the baseline numerical simulation, we employed 1000 collocation points, randomly distributed to ensure adequate coverage of the entire computational domain.
• RAR: Lu et al. [37] developed RAR, which begins with a fixed number of randomly generated localization points (here, 500). Additional collocation points are typically added automatically in regions exhibiting large error residuals during training. This method represents one of the specifications we replicate.
• Short-time update sampling (SUS-1000/SUS-10): The adopted approach is not contingent on fixed localization sites, as dynamic methods require. Instead, additional collocation points are continuously added during training. The reference method employs only the “short-time update” approach, using a fully random substitution process. The resulting statistics serve to verify the efficacy of the “point elimination” technique by reporting the total number of localization points replicated at each iteration.
The corresponding reduction in prediction error on the experimental set using various selection procedures is shown in Fig. 23. Specifically, the error variation with respect to the training period is shown in Fig. 23b, while the error variation with respect to the number of training iterations is shown in Fig. 23a.

Figure 23: Analysis of various collocation point selection methods in terms of the reduction of test set error during PINN training. The SUS approach significantly accelerates learning, with SUS-10 achieving much faster convergence than SUS-1000. The most efficient method is ASUS, which enables Phy-APMR to complete training in approximately 22 s without compromising inference accuracy.
These findings clearly demonstrate the advantages of ASUS. The effectiveness of RAR gradually increases as additional dynamic localization sites are progressively added, and (i) ASUS achieves significant efficiency improvements compared to dynamic collection. This improvement is largely attributed to the “efficiency-driven allocation” strategy employed in RAR.
(ii) The “short-time update” technique significantly enhances training efficiency, as evidenced by the high speed achieved using the SUS method.
(iii) Limiting the number of collocation points leads to a higher total number of training iterations when comparing SUS-10 and SUS-1000. However, since each iteration with fewer localization points is computationally cheaper, the overall training time is ultimately reduced. Consequently, when combined with the “short-time update” technique, the point reducing strategy effectively accelerates the training process.
(iv) The optimal results are achieved by ASUS, which integrates “efficiency-driven allocation,” “sparse point selection,” and “time-localized modification.” Using ASUS, Phy-APMR can support real-time execution requirements by generating one output per segment, or one hour of data, in approximately 20 s.
(v) The final prediction error achieved using ASUS is comparable to that of various benchmarks in terms of accuracy. This indicates that ASUS accelerates training without compromising prediction quality.
Using the computing configuration described in Section 5.1.7, Table 10 presents the training durations for our proposed method, Phy-APMR wind strength optimized with ASUS, compared to the benchmark approaches outlined in Section 5.1.6. For 80,000 training iterations, Phy-APMR wind velocity (ASUS) requires 181.4 s, highlighting the significant computational overhead. In contrast, with ASUS, Phy-APMR wind frequency completes in just 27.4 s using only 15,000 iterations. Compared with similarly advanced NN-based benchmarks such as AQI-M and HMSS, these results demonstrate the substantial efficiency and productivity gains, indicating that our approach achieves fast training while maintaining high predictive efficiencies.

In what follows, we underscore the need for more robust strategies to tackle Phy-APMR’s inherent uncertainties and underlying environmental factors. Moreover, efficient automated hyperparameter tuning could reduce computational costs while improving model accuracy.
6.1 Limitations and Future Work
Despite yielding promising results, the Phy-APMR architecture has several limitations that warrant consideration. Internal and hidden factors, such as pollutant source concentrations and complex wind patterns, increase the complexity of the PDEs encoded within the PINN structure. While these factors are crucial for a better understanding of atmospheric pollutant mechanisms, their intrinsic ambiguity complicates accurate estimation and decoding.
In addition, model performance is highly sensitive to the total number of training sessions and the loss-scaling coefficient (
Although Phy-APMR achieves satisfactory performance, several limitations remain for future research. First, an evaluation of different combinations of loss functions suggested that the impact of the loss design on the results was minimal. This may be attributed to assigning identical weights for hierarchical levels and different tasks in the loss function, and empirically determining the weights for physics constraints. Recent advances in DL have shown that self-adaptive weighting of losses can enhance model performance [33]. Future studies could explore optimal, dynamically learned weights to further improve estimation accuracy.
Second, remote sensing products exhibit nonrandom data gaps due to cloud contamination or restricted observation conditions. In this study, no additional preprocessing was applied to address missing data in the TROPOMI L3 products, as the primary focus was on developing and interpreting the joint estimation algorithms. Previous research has indicated that high rates of missing data can introduce biases in spatiotemporal pollutant distributions [26–28]. To address this, future work could develop specific gap-filling methods for TROPOMI products, which would allow for more accurate reconstruction of spatiotemporal patterns for O3, PM2.5, and PM10.
Finally, the ASUS method was utilized to interpret the proposed model and obtain rational insights into predictor contributions. While ASUS provides valuable guidance on feature importance, it is unable to distinguish between positive and negative contributions [30]. Future research could explore the use of SHapley Additive exPlanations (SHAP) to enhance interpretability, enabling broader applications of air quality index in environmental policy-making and decision support [44]. In conclusion, while Phy-APMR demonstrates strong performance in joint pollutant estimation, addressing hyperparameter sensitivity, data gaps, and interpretability limitations will be essential for future improvements and practical deployment.
6.2 Relevance to Practice and Policy
The Phy-APMR architecture enables scientifically informed assessment and development, advancing both research knowledge and practical implementation. Accurate characterization of contamination patterns is essential for identifying pollutant sources and designing sustainable regulatory strategies. Three distinct types of contamination were identified through experimental case studies:
• Endogenous O3 formation: The upward trajectory is primarily driven by local photochemical reactions involving nitrogen oxides (NOx) and volatile organic compounds (VOCs), showing a gradual increase during daylight hours with periodic fluctuations, as illustrated in Fig. 24a.
• Exogenous O3 transport: Advection of ozone-rich air from neighboring regions influences this trend, exerting a broad geographical impact. As shown in Fig. 24b, O3 concentrations tend to accumulate in downwind areas while gradually dispersing across the study domain.
• Dynamic O3 peaks: The phenomenon, which typically emerges in the late morning or early afternoon, results from interactions between solar radiation, temperature, and precursor emissions, producing additional O3. As depicted in Fig. 24c, concentrations rise consistently across the entire region during these periods.
• Endogenous PM2.5 pollution: The upward trajectory is primarily driven by contributions from regional buildings and routine behaviors, showing a gradual increase with occasional breakdowns, as illustrated in Fig. 25a.
• Exogenous PM2.5 pollution: The movement of contaminated air volumes across the research region drives this trend, exerting an overarching geographical influence. As shown in Fig. 25b, PM2.5 levels accumulate in the outermost areas while gradually diffusing inward.
• Dynamic PM2.5 toxic concentrations: The phenomenon, which specifically emerges at the onset of daytime, results from complex environmental biogeochemical interactions that adapt and produce additional PM2.5. As depicted in Fig. 25c, the values escalate consistently over the entire zone.
• Endogenous PM10 emissions: The upward trajectory is primarily driven by local activities such as construction, traffic, and industrial operations, showing a gradual increase with occasional spikes, as illustrated in Fig. 26a.
• Exogenous PM10 transport: Movement of dust and particulate-laden air from surrounding regions influences this trend, exerting a broad geographical impact. As shown in Fig. 26b, PM10 concentrations tend to accumulate in downwind areas while gradually dispersing inward across the study domain.
• Dynamic PM10 peaks: The phenomenon, often occurring during periods of high wind or human activity, results from complex interactions between meteorological conditions and local emissions, generating additional PM10. As depicted in Fig. 26c, concentrations rise consistently across the entire region during these events.

Figure 24: Pollution patterns for O3 at different time zones.

Figure 25: Pollution patterns for PM2.5 at different time zones.

Figure 26: Pollution patterns for PM10 at different time zones.
Taken together, these results highlight Phy-APMR’s potential to provide valuable insights into contamination behaviors, thereby supporting metropolitan sustainability regulation and informing legislative decisions.
In upcoming work, we aim to present portable crowdsensing as a viable and scalable methodology for metropolitan air quality surveillance. Unlike traditional fixed monitoring stations, these systems employ multiple sensing modules to enable comprehensive meteorological data collection at substantially lower costs.
7.1 IoT-Driven Air Quality Crowdsensing
Several dynamic crowdsensing systems have been developed [45,46] as cost-effective solutions for collecting multimodal atmospheric data across urban areas [47]. Existing methodologies can generally be classified into three categories: automobile-deployed, aerial platform-driven, and multi-agent heterogeneous systems.
Commercial automobiles [48] and public transportation systems traversing metropolitan road networks are employed in vehicle-based approaches. Owing to their comprehensive coverage and predictable movement patterns, these methods provide a all-encompassing understanding of pollution heterogeneity among various urban areas.
Drone-driven techniques enable large-scale atmospheric sensing, communication, and task scheduling with higher efficiency, while also accessing areas inaccessible to ground-based equipment [49]. Their ability to continuously adjust flight paths and altitudes allows for targeted measurements at designated points and elevations.
Multi-agent heterogeneous strategies integrate the merits of vehicles and drones [50], or deploy swarms with complementary sensing capabilities. By combining the broad coverage of ground vehicles with the aerial agility of unmanned aircraft, these approaches establish robust and accurate environmental monitoring networks.
7.2 Atmospheric Contaminant Mapping
Techniques for reconstructing air pollution visualizations can be broadly classified as data-driven or physics-based. Physics-driven approaches rely on well-established air dispersion principles to address atmospheric contamination estimation and source identification, typically on large spatio-temporal scales. These problems are often regarded as subfields of environmental research [14].
Access to extensive datasets has driven the emergence of data-driven methodologies. Machine learning and DL techniques are now widely applied to predict air pollution propagation. For instance, Xu et al. [51] proposed a co-training framework for fine-grained PM2.5 concentration based on multisource data, while Bakirci [52] proposed enhancing air pollution mapping using autonomous unmanned aerial vehicle networks for improved coverage and consistency. Rahman et al. [53] presented th predictive machine learning model for air quality forecasting using a web interface. Rao et al. [54] presented the air quality assessment in Beijing based on cloud model. SPR-based techniques are now regarded as frontier approaches in this research area. Amid the surge in DL, models such as ConvLSTM [55], graph neural network and Transformer-based model [56], and attention-enhanced LSTM [57] have been widely adopted. Further studies have explored improving reconstruction performance by selecting representative mobile sensors. The design of the Phy-APMR framework was largely inspired by Chen et al. [40]’s HMSS, a synergistic model combining physics-driven and empirical modeling strategies.
PINNs, first introduced by Raissi et al. [11], provided an effective approach for handling PDEs and other dynamical problems. They are particularly valuable in modern machine learning due to their ability to simultaneously address both forward and inverse problems [58]. Since their introduction, PINNs have proven effective in a range of disciplines, including viscosity, thermodynamic modeling, and intelligent transportation. Zou et al. [59] demonstrated that model misspecification significantly affects the accuracy of PINNs. Cuomo et al. [60] investigated scientific machine learning through PINN. Subsequent studies have extended PINNs to stochastic differential equations [61] and time-dependent stochastic PDEs [62]. To facilitate their adoption, several toolkits have been developed, including DeepXDE [37] and NVIDIA’s SimNet [63].
7.4 Synergies of Phy-APMR with the Built Environment and Ecology
The Phy-APMR approach has the potential to significantly advance current developments in environmental and structural research in two key areas. First, much of the existing work in this field focuses on data-driven predictive algorithms such as LSTMs [55–57], transformers [64], and deep NNs [65] for forecasting air pollutants. By embedding air dispersion physics into DL models, the proposed system mitigates data scarcity, a common limitation of data-intensive forecasting methods. Second, IoT-enabled sensing technologies are increasingly employed for monitoring both indoor and outdoor environments [66,67]. A persistent challenge in this context is distinguishing valuable observations from redundant data, particularly given the periodicity and unpredictability of sensor mobility. The ASUS sampling strategy proposed in this study offers an effective means of identifying informative samples. Guided by the spatial distribution of empirical data, it also enables optimized sensor deployment to enhance forecasting performance.
In this study, we proposed the Phy-APMR framework to advance urban environmental monitoring and support the development of healthy buildings. By embedding PDE-based physical knowledge of air pollutant propagation, we designed a tailored PINN structure capable of fine-grained APMR. To accelerate model convergence and enhance training efficiency, the ASUS algorithm was integrated into the framework. Experiments conducted across three Pakistani cities demonstrated that, with ASUS optimization, Phy-APMR achieves training speeds sufficient for real-time, fine-grained APMR.
To evaluate the broader applicability of the framework, we also implemented a novel interpretable PINN model for joint estimation of daily O3, PM2.5, and PM10 concentrations across Pakistan. The experimental split protocol is now explicitly described in the Section 5.1.3, specifying the split strategy and hold-out design to ensure no data leakage, and the reported results fully reflect this protocol. Comprehensive evaluation showed strong spatiotemporal robustness, with sample-based CV
Looking forward, future research can enhance Phy-APMR’s reliability and adaptability through dynamic diffusion control. Two potential strategies include:
1. Adaptively updating the diffusion coefficient
2. Treating
In addition, exploring the highest-quality precision settings for different urban contexts could lead to further technical refinements and broader applicability of the framework.
Overall, the proposed Phy-APMR framework demonstrates that embedding physical knowledge, enhancing inter-pollutant interactions, and applying interpretable DL techniques can significantly improve the accuracy, efficiency, and interpretability of urban air pollutant monitoring. These advancements provide a robust foundation for real-time monitoring and informed decision-making to mitigate urban air pollution effectively.
Acknowledgement: Not applicable.
Funding Statement: This work was supported and funded by the Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University (IMSIU) (grant number IMSIU-DDRSP2604).
Author Contributions: Taher Alzahrani: Conceptualization, methodology, validation, writing—review and editing, funding acquisition, Saima Rashid: Formal analysis, investigation, data curation, project administration, supervision, writing—original draft preparation. All authors reviewed and approved the final version of the manuscript.
Availability of Data and Materials: Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.
Ethics Approval: Not applicable.
Conflicts of Interest: The authors declare no conflicts of interest.
Nomenclature
| Phy-APMR | Physics-air pollution map reconstruction |
| WHO | World Health Organization |
| NNs | Neural networks |
| PINNs | Physics-informed NNs |
| CNN | Convolutional NNs |
| DL | Deep learning |
| ASUS | Adaptive short-time update sampling |
| PDEs | Partial differential equations |
| Pak-EPA | Pakistan Environmental Protection Agency |
| TROPOMI | Tropospheric Monitoring Instrument |
| MODIS | Moderate Resolution Imaging Spectroradiometer |
| L3 | Daily Level 3 |
| GEE | Google Earth Engine |
| AOD | Aerosol Optical Depth |
| MAIAC | Multi-Angle Implementation of Atmospheric Correction |
| ECMWF | European Centre for Medium-Range Weather Forecasts |
| BLH | Boundary layer height |
| RH | Relative humidity |
| SP | Surface pressure |
| SSR | Surface net solar radiation |
| STR | Surface net thermal radiation |
| IoT | Internet of Things |
| UVB | Downward UV radiation at the surface |
| T2M | 2-m temperature |
| U10 | 10-m U wind |
| V10 | 10-m V wind |
| GO3 | Surface O3 mass mixing ratio |
| CAMS | Copernicus Atmosphere Monitoring Service |
| FE | Feature encoder |
| LeakyReLU | Leaky Rectified Linear Unit |
| SA | Self-attention |
| NOx | Nitrogen oxides |
| VOCs | Volatile organic compounds |
| SO2 | Sulfur dioxide |
| CO | Carbon monoxide |
| OH | Hydroxyl radicals |
| MSE | Mean squared error |
| Phy-APMR-WS | Phy-APMR wind speed |
| RAR | Residual-based adaptive refinement |
| CV | Cross-validation |
| Coefficient of determination | |
| RMSE | Root mean squared error |
| MAE | Mean absolute error |
| MAPE | Mean Absolute Percentage Error |
| PSA | Parametric spline approximation |
| PIDT | Physics-informed difference technique |
| FFNN | Feedforward NN |
| SPR | Stochastic process regression |
| HMSS | Hybrid model-enabled sensing system |
| Model_base | Baseline model |
| Model_wa | Attention module |
| Model_wi | Interaction module |
References
1. Sigsgaard T, Hoffmann B. Assessing the health burden from air pollution. Science. 2024;384(6691):33–4. doi:10.1126/science.abo3801. [Google Scholar] [PubMed] [CrossRef]
2. World Health Organization (WHO). Ambient (outdoor) air quality and health; 2018. [cited 2025 Jul 26]. Available from: http://www.who.int/news-room/fact-sheets/detail/ambient-(outdoor)-air-quality-and-health. [Google Scholar]
3. Relvas H, Lopes D, Armengol JM. Empowering communities: advancements in air quality monitoring and citizen engagement. Urban Clim. 2025;60(1):102344. doi:10.1016/j.uclim.2025.102344. [Google Scholar] [CrossRef]
4. Morales-García J, Ramos-Sorroche E, Balderas-Díaz S, Guerrero-Contreras G, Muñoz A, Santa J, et al. Reducing pollution health impact with air quality prediction assisted by mobility data. IEEE J Biomed Health Inform. 2025;29(12):9210–20. doi:10.1109/JBHI.2024.3508466. [Google Scholar] [PubMed] [CrossRef]
5. Bhardwaj A, Balashankar A, Iyer S, Soans N, Sudarshan A, Pande R, et al. Comprehensive monitoring of air pollution hotspots using sparse sensor networks. ACM J Comput Sustain Soc. 2025;3(4):1–36. doi:10.1145/3748821. [Google Scholar] [CrossRef]
6. Birenboim A, Helbich M, Kwan M-P. Advances in portable sensing for urban environments: understanding cities from a mobility perspective. Comput Environ Urban Syst. 2021;88(12):101650. doi:10.1016/j.compenvurbsys.2021.101650. [Google Scholar] [CrossRef]
7. Hasenfratz D, Saukh O, Walser C, Hueglin C, Fierz M, Arn T, et al. Deriving high-resolution urban air pollution maps using mobile sensor nodes. Pervasive Mob Comput. 2015;16:268–85. doi:10.1016/j.pmcj.2014.11.008. [Google Scholar] [CrossRef]
8. Pleim J, Ran L. Surface flux modeling for air quality applications. Atmosphere. 2011;2(3):271–302. doi:10.3390/atmos2030271. [Google Scholar] [CrossRef]
9. Zhang H, Linford JC, Sandu A, Sander R. Chemical mechanism solvers in air quality models. Atmosphere. 2011;2(3):510–32. doi:10.3390/atmos2030510. [Google Scholar] [CrossRef]
10. Arshi O, Mondal S. Advancements in sensors and actuators technologies for smart cities: a comprehensive review. Smart Constr Sustain Cities. 2023;1(1):18. doi:10.1007/s44268-023-00022-2. [Google Scholar] [CrossRef]
11. Raissi M, Perdikaris P, Karniadakis GE. Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J Comput Phys. 2019;378:686–707. doi:10.1016/j.jcp.2018.10.045. [Google Scholar] [CrossRef]
12. Karniadakis GE, Kevrekidis IG, Lu L, Perdikaris P, Wang S, Yang L. Physics-informed machine learning. Nat Rev Phys. 2021;3(6):422–40. doi:10.1038/s42254-021-00314-5. [Google Scholar] [CrossRef]
13. Li B, Hu Q, Gao M, Liu T, Zhang C, Liu C. Physics-informed neural network improving the WRF-CHEM results of air pollution using satellite-based remote sensing data. Atmos Environ. 2023;311:120031. doi:10.1016/j.atmosenv.2023.120031. [Google Scholar] [CrossRef]
14. de Villeroché A, Le Guen V, Mouradi RS, Massin P, Bocquet M, Farchi A, et al. Physics-informed neural networks for atmospheric flow modeling of pollutant dispersion in industrial sites. Air Qual Atmos Health. 2026;19(3):38. doi:10.1007/s11869-026-01934-5. [Google Scholar] [CrossRef]
15. Bagkis E, Hassani A, Schneider P, DeSouza P, Shetty S, Kassandros T, et al. Evolving trends in application of low-cost air quality sensor networks: challenges and future directions. npj Clim Atmos Sci. 2025;8(1):335. doi:10.1038/s41612-025-01216-4. [Google Scholar] [CrossRef]
16. Zoest V, Osei FB, Stein A, Hoek G. Calibration of low-cost NO2 sensors in an urban air quality network. Atmos Environ. 2019;210:66–75. doi:10.1016/j.atmosenv.2019.04.048; [Google Scholar] [CrossRef]
17. Lawrence S, Bhathmanabhan S. Harnessing deep learning for air pollution forecasting: trends, techniques, and future prospects. Artif Intell Rev. 2026;59(3):104. doi:10.1007/s10462-026-11496-8. [Google Scholar] [CrossRef]
18. Li T, Yang Q, Wang Y, Wu J. Joint estimation of PM2.5 and O3 over China using a knowledge-informed neural network. Geosci Front. 2023;14(2):101499. doi:10.1016/j.gsf.2022.101499. [Google Scholar] [CrossRef]
19. Yang H, Yang R, Ding L, Du S, Li M, Zhang B. Enhanced air pollution spatiotemporal forecast model using frequency domain convolution and attention mechanism. Eng Appl Artif Intell. 2025;162(2):112538. doi:10.1016/j.engappai.2025.112538. [Google Scholar] [CrossRef]
20. Shi R, Luo J, Zhou N, Liu Y, Hong C, Zhang X-P, et al. Phy-APMR: a physics-informed air pollution map reconstruction approach with mobile crowd-sensing for fine-grained measurement. Build Environ. 2025;272(2):112634. doi:10.1016/j.buildenv.2025.112634. [Google Scholar] [CrossRef]
21. Zhou N, Hong WJ, Wang HD, Zheng JF, Wang QH, Song YL, et al. STeP-Diff: spatio-temporal physics-informed diffusion models for mobile fine-grained pollution forecasting. arXiv:2512.04385. 2025. [Google Scholar]
22. Perry N, Pedersen PP, Christensen CN, Nussli M, Heinonen S, Dagallier LG, et al. Detecting urban PM2.5 hotspots with mobile sensing and Gaussian process regression. npj Clean Air. 2025;1–38. doi:10.1038/s44407-025-00038-1. [Google Scholar] [CrossRef]
23. Guo Q, He Z, Li S, Li X, Meng J, Hou Z, et al. Air pollution forecasting using artificial and wavelet neural networks with meteorological conditions. Aerosol Air Qual Res. 2020;20(6):1429–39. doi:10.4209/aaqr.2020.03.0097. [Google Scholar] [CrossRef]
24. Chen G, Chen S, Li D, Chen C. A hybrid deep learning air pollution prediction approach based on neighborhood selection and spatio-temporal attention. Sci Rep. 2025;15(1):3685. doi:10.1038/s41598-025-88086-1. [Google Scholar] [PubMed] [CrossRef]
25. Khan S, Munawar S, Miandad M, Bajwa N, Basharat M, Chohan A. Spatiotemporal variations of key air pollutants (O3, NO2, CO) in major urban centres of Pakistan using Sentinel-5P satellite data. Environ Qual. 2026. doi:10.12775/EQ.2026.012. [Google Scholar] [CrossRef]
26. Bilal M, Mhawish A, Nichol JE, Qiu Z, Nazeer M, Ali MA, et al. Air pollution scenario over Pakistan: characterization and ranking of extremely polluted cities using long-term concentrations of aerosols and trace gases. Remote Sens Environ. 2021;264:112617. doi:10.1016/j.rse.2021.112617. [Google Scholar] [CrossRef]
27. Liu T, Chen G, Lin Z, Deng Y, Chen J. Synergistic contributions of typical photochemical reactive species to ozone pollution in the southeastern coastal city of China. J Hazard Mater. 2025;497(10):139722. doi:10.1016/j.jhazmat.2025.139722. [Google Scholar] [PubMed] [CrossRef]
28. Souza AD, Oliveira-Júnior JFD, Cardoso KRA, Fernandes WA, Pavao HG. The impact of meteorological variables on particulate matter concentrations. Atmos. 2025;16(7):875. doi:10.3390/atmos16070875. [Google Scholar] [CrossRef]
29. Ma J, Chen Z, Wang J, Wang Y, Li L. Diffusion simulation, health risks, ozone and secondary organic aerosol formation potential of gaseous pollutants from rural comprehensive waste treatment plant. Chemosphere. 2022;286:131857. doi:10.1016/j.chemosphere.2021.131857. [Google Scholar] [PubMed] [CrossRef]
30. Jamil MS, Jamil MA, Mazhar A, Ikram A, Ahmed A, Munawar U. Smart environment monitoring system by employing wireless sensor networks on vehicles for pollution free smart cities. Procedia Eng. 2015;107(2):480–4. doi:10.1016/j.proeng.2015.06.106. [Google Scholar] [CrossRef]
31. Maas AL, Hannun AY, Ng AY. Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of the 30th International Conference on Machine Learning (ICML); 2013 Jun 16–21; Atlanta, GA, USA. [Google Scholar]
32. Mohyuddin S, Alam K, Zeb B, Khokhar MF, Mir KA, Wexler AS, et al. Characterization and source identification of PM2.5 during intense haze episodes in an urban environment of Lahore. Atmos Environ X. 2024;23(5):100276. doi:10.1016/j.aeaoa.2024.100276. [Google Scholar] [CrossRef]
33. Yu X, Wong MS, Lee K-H. Attention mechanism augmented random forest model for multiple air pollutants estimation. Int J Appl Earth Obs Geoinf. 2025;141(5):104661. doi:10.1016/j.jag.2025.104661. [Google Scholar] [CrossRef]
34. Yang Q, Yuan Q, Gao M, Li T. A new perspective to satellite-based retrieval of ground-level air pollution: simultaneous estimation of multiple pollutants based on physics-informed multi-task learning. Sci Total Environ. 2023;857(3):159542. doi:10.1016/j.scitotenv.2022.159542. [Google Scholar] [PubMed] [CrossRef]
35. Stojičić S, Cisar P, Radovanović R, Petrović N, Blagojević M, Srećković M, et al. One aspect of environmental engineering–modelling the air pollution effects using advection-diffusion. Acta Polytech Hung. 2024;2. doi:10.12700/aph.21.11.2024.11.15. [Google Scholar] [CrossRef]
36. Kingma DP, Ba J. Adam: a method for stochastic optimization. arXiv:1412.6980. 2014. [Google Scholar]
37. Lu L, Meng X, Mao Z, Karniadakis GE. DeepXDE: a deep learning library for solving differential equations. SIAM Rev. 2021;63(1):208–28. doi:10.1137/19M1274067. [Google Scholar] [CrossRef]
38. Cheng Y, Li X, Li Z, Jiang S, Li Y, Jia J, et al. AirCloud: a cloud-based air-quality monitorisng system for everyone. In: Proceedings of the 12th ACM Conference on Embedded Network Sensor Systems (SenSys); 2014 Nov 3–6; Memphis, Tennessee. p. 251–65. doi:10.1145/2668332.2668346. [Google Scholar] [CrossRef]
39. Liu N, Liu X, Lin P-T, Wang Y, Zhang L. Enhanced air quality inference via multi-view learning with mobile sensing memory. IEEE Access. 2022;10:36616–28. doi:10.1109/ACCESS.2022.3164506. [Google Scholar] [CrossRef]
40. Chen X, Xu S, Liu X, Xu X, Noh HY, Zhang L, et al. Adaptive hybrid model-enabled sensing system (HMSS) for mobile fine-grained air pollution estimation. IEEE Trans Mob Comput. 2022;21(6):1927–44. doi:10.1109/TMC.2020.3034270. [Google Scholar] [CrossRef]
41. Dovrou E, Bates KH, Moch JM, Mickley LJ, Jacob DJ, Keutsch FN. Catalytic role of formaldehyde in particulate matter formation. Proc Natl Acad Sci U S A. 2022;119(6):e2113265119. doi:10.1073/pnas.2113265119. [Google Scholar] [PubMed] [CrossRef]
42. Jiang K, Xing R, Luo Z, Huang W, Yi F, Men Y, et al. Pollutant emissions from biomass burning: a review on emission characteristics, environmental impacts, and research perspectives. Particuology. 2024;85(20):296–309. doi:10.1016/j.partic.2023.07.012. [Google Scholar] [CrossRef]
43. Su Z, Yang L, Chen Y, Ni R, Wang W, Hu H, et al. Analysis of synergistic changes in PM2.5 and O3 concentrations based on structural equation model study. Atmos. 2024;15(11):1374. doi:10.3390/atmos15111374. [Google Scholar] [CrossRef]
44. Gowri L, Sriya NS, Sowmya M, Lakshmi ARH, Amirtharajan R. Explainable AI for urban air quality: SHAP interpretation of stacked ensemble AQI forecast. Theor Appl Climatol. 2025;156(10):533. doi:10.1007/s00704-025-05741-3. [Google Scholar] [CrossRef]
45. Temkov S, Cavkovski P, Lameski P, Zdravevski E, Herzog MA, Trajkovik V. Air pollution data: a dataset gathered through a crowd sensing platform. Data Brief. 2025;61(3):111683. doi:10.1016/j.dib.2025.111683. [Google Scholar] [PubMed] [CrossRef]
46. Vahdat-Nejad H, Asef M. Architecture design of the air pollution mapping system by mobile crowd sensing. IET Wirel Sensor Syst. 2018;8(6):268–75. doi:10.1049/iet-wss.2018.5130. [Google Scholar] [CrossRef]
47. Tam BCM, Tang S, Cardoso A. A Multimodal approach to improve the prediction capabilities of deep learning models for multivariate time series—a case of subtropical urban air quality. Environ Eng Sci. 2026;43(3):97–109. doi:10.1177/15579018251410731. [Google Scholar] [CrossRef]
48. Bajwa AU, Sheikh HA. Contribution of road transport to pakistan’s air pollution in the urban environment. Air. 2023;1(4):237–57. doi:10.3390/air1040018. [Google Scholar] [CrossRef]
49. Afshar-Mohajer N, Wu C-Y. Use of a drone-based sensor as a field-ready technique for short-term concentration mapping of air pollutants: a modeling study. Atmos Environ. 2023;294(2):119476. doi:10.1016/j.atmosenv.2022.119476. [Google Scholar] [CrossRef]
50. Mokhtari I, Bechkit W, Assenine MS, Rivano H. Navigating the Smog: a cooperative multi-agent RL for accurate air pollution mapping through data assimilation. arXiv:2407.12539. 2024. [Google Scholar]
51. Xu X, Tong T, Zhang W, Meng L. Fine-grained prediction of PM2.5 concentration based on multisource data and deep learning. Atmos Pollut Res. 2020;11(10):1728–37. doi:10.1016/j.apr.2020.06.032. [Google Scholar] [CrossRef]
52. Bakirci M. Enhancing air pollution mapping with autonomous UAV networks for extended coverage and consistency. Atmos Res. 2024;306(6):107480. doi:10.1016/j.atmosres.2024.107480. [Google Scholar] [CrossRef]
53. Rahman MM, Nayeem MEH, Ahmed MS, Tanha KA, Sakib MSA. AirNet: predictive machine learning model for air quality forecasting using a web interface. Environ Syst Res. 2024;13(1):12. doi:10.1186/s40068-024-00378-z. [Google Scholar] [CrossRef]
54. Rao W, Li J, Guo H. Air quality assessment in Beijing based on cloud model. Sci Rep. 2025;15(1):21994. doi:10.1038/s41598-025-05751-1. [Google Scholar] [PubMed] [CrossRef]
55. Ram RS, Venkatachalam K, Masud M, Abouhawwash M. Air pollution prediction using dual graph convolution LSTM technique. Intell Autom Soft Comput. 2022;33(3):1639–52. doi:10.32604/iasc.2022.023962. [Google Scholar] [CrossRef]
56. Ye Y, Cao Y, Dong Y, Yan H. A graph neural network and Transformer-based model for PM2.5 prediction through spatiotemporal correlation. Environ Model Softw. 2025;191(23):106501. doi:10.1016/j.envsoft.2025.106501. [Google Scholar] [CrossRef]
57. Shakir M, Kumaran U, Rakesh N. Enhanced temporal attention-based LSTM model for air quality forecasting. Intl J Info Technol. 2025. doi:10.1007/s41870-025-02668-w. [Google Scholar] [CrossRef]
58. Cai S, Wang Z, Wang S, Perdikaris P, Karniadakis GE. Physics-informed neural networks for heat transfer problems. J Heat Transf. 2021;143(6):061003. doi:10.1115/1.4049441. [Google Scholar] [CrossRef]
59. Zou Z, Meng X, Karniadakis GE. Correcting model misspecification in physics-informed neural networks (PINNs). J Comput Phys. 2024;505(23):112918. doi:10.1016/j.jcp.2024.112918. [Google Scholar] [CrossRef]
60. Cuomo S, Di Cola VS, Giampaolo F, Rozza G, Raissi M, Piccialli F. Scientific machine learning through physics-informed neural networks: where we are and what’s next. J Sci Comput. 2022;92(3):88. doi:10.1007/s10915-022-01939-z. [Google Scholar] [CrossRef]
61. Yang L, Zhang D, Karniadakis GE. Physics-informed generative adversarial networks for stochastic differential equations. SIAM J Sci Comput. 2020;42(1):459–84. doi:10.1137/18M122540. [Google Scholar] [CrossRef]
62. Zhang D, Guo L, Karniadakis GE. Learning in modal space: solving time-dependent stochastic PDEs using physics-informed neural networks. arXiv:1905.01205. 2019. [Google Scholar]
63. Hennigh O, Narasimhan S, Nabian MA, Subramaniam A, Tangsali K, Rietmann M, et al. NVIDIA SimNetTM: an AI-accelerated multi-physics simulation framework. In: International Conference on Computational Science (ICCS); 2021 Jun 16–18; Krakow, Poland. p. 1–12. [Google Scholar]
64. Charco JL, Espín-Riofrio C, Yanza-Montalván A, Cruz-Chóez A, Quevedo-Sacoto A. Forecasting PM2.5 concentrations in ambient air using a transformer-based neural network. In: Information and communication technologies. Cham, Switzerland: Springer; 2023. p. 1–12. doi:10.1007/978-3-031-45438-7_10. [Google Scholar] [CrossRef]
65. Zhang GP. Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing. 2003;50:159–75. doi:10.1016/s0925-2312(01)00702-0. [Google Scholar] [CrossRef]
66. Dhingra S, Madda RB, Gandomi AH, Patan R, Daneshmand M. Internet of Things mobile-air pollution monitoring system (IoTMobair). IEEE Internet Things J. 2019;6(3):5577–84. doi:10.1109/JIOT.2019.2903821. [Google Scholar] [CrossRef]
67. Ali S, Glass T, Parr B, Potgieter J, Alam F. Low-cost sensor with IoT LoRaWAN connectivity and machine learning-based calibration for air pollution monitoring. IEEE Trans Instrum Meas. 2021;70:5500511. doi:10.1109/TIM.2020.3034109. [Google Scholar] [CrossRef]
Cite This Article
Copyright © 2026 The Author(s). Published by Tech Science Press.This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


Submit a Paper
Propose a Special lssue
View Full Text
Download PDF
Downloads
Citation Tools