Open Access
ARTICLE
Computationally Efficient Gradient-Aware Hyperspectral Image Denoising Using Center-Difference Convolutional Networks
1 Department of Computer Science and Information Technology, University of Kamalia, Kamalia, Pakistan
2 Department of Communication and Cyber Security, Bahuddin Zakariya University, Multan, Pakistan
3 Department of Computer Science and Artificial Intelligence, College of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia
4 Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam bin Abdulaziz University, Al-Kharj, Saudi Arabia
5 Department of Computer Science and Engineering, University of Hafr Al-Batin, Hafar Al-Batin, Riyadh, Saudi Arabia
6 Department of Computer Science & Information Technology, The Islamia University of Bahawalpur, Bahawalpur, Pakistan
7 Department of Computer Science and Engineering, Soonchunhyang University, Asan, Republic of Korea
* Corresponding Authors: Muhammad Umer. Email: ; Yongwon Cho. Email:
(This article belongs to the Special Issue: Emerging Artificial Intelligence Technologies and Applications-II)
Computer Modeling in Engineering & Sciences 2026, 147(3), 41 https://doi.org/10.32604/cmes.2026.078738
Received 07 January 2026; Accepted 27 April 2026; Issue published 30 June 2026
Abstract
Hyperspectral image (HSI) denoising is a crucial preprocessing step that significantly enhances the performance of downstream applications, such as object detection and classification. Whereas deep neural networks have achieved remarkable performance in HSI denoising, many existing models rely mostly on vanilla convolutions, which often fail to capture fine-grained noise patterns and structural details in real-time HSIs. To address these limitations, we propose a novel Center-Difference Convolutional Network (CDCN) designed to effectively suppress various noise types while preserving the inherent structure of HSIs. By leveraging center-difference convolution (CDC), our model captures both gradient and intensity information in the spatial domain, enabling better discrimination of subtle noise characteristics. The CDCN architecture processes 3D HSI cubes through separable 3D convolutions, efficiently extracting spatial-spectral features with minimal computational overhead. Additionally, a spatial-spectral attention mechanism is integrated to further refine feature representation. We evaluate the proposed method on one simulated dataset (Kennedy Space Center) and two real-world datasets (Pavia Center and Houston-2018). Experimental results demonstrate that CDCN consistently outperforms existing state-of-the-art approaches, achieving superior denoising performance while maintaining spectral-spatial information. Ablation studies also validate the effectiveness of CDC and attention mechanisms in enhancing denoising capability over standard convolutional baselines.Keywords
Hyperspectral images (HSIs) contain high spectral resolution features, widely employed in various remote sensing applications, including object recognition, unmixing, and classification [1,2]. Due to photon effects, sensor malfunctions, and atmospheric impacts, HSIs often exhibit noise, including Gaussian noise, stripe noise, random noise, and dead pixels [3–5]. Such noises can dramatically affect the interpretation of information. Hence, it is crucial to implement a pre-processing denoising step before analyzing HSI.
In recent decades, various techniques have been implemented to reduce noise from HSI. Most methods were initially designed for red-green-blue (RGB) or grayscale images, ignoring the spectral dimension, relying upon band-wise modeling, such as nonlocal self-similarity (NLSS) methods, block matching 3D filtering [6], and weighted nuclear norm [7] (WNNM). These techniques treat each band like a 2D image, thus distorting the spectral information. Although many researchers have recently utilized combined spatial-spectral information to remove noise from HSI, these optimization-based methods require tuning parameters for each HSI [8], which makes the denoising operation time-consuming and limits its applicability in real-world operating environments. Besides, these strategies could produce spectral distortion in complex scenes [9,10]. Consequently, research is focusing on models that can handle different kinds of HS data and perform well in complex scenarios. HSI-DeNet [11] demonstrated the effectiveness of convolutional networks for learning spatial-spectral features, while residual CNN-based methods further improved representational capability [12]. A trainable sparse coding model was introduced in [13], and deep spatial-spectral representation learning was explored in [14]. Global reasoning-based networks [15] and hybrid noise modeling approaches [16] enhanced performance under complex noise conditions. In addition, quasi-recurrent architectures have shown strong capability in modeling spectral dependencies [17]. Most of these methods process the spatial-spectral information of HSIs using different technologies, such as 2D convolutions [11], 3D convolutions (i.e., convolutions performed simultaneously on both spatial and spectral dimensions), and hybrid models with both 2D and 3D filters [16]. However, the HSI datasets are ecologically sensitive, resulting in fine-grained and complex noise. Although vanilla convolutional neural network (CNN)-based approaches can provide deep semantic features, the related features cannot generally provide fine-grained information, and therefore, they are less sensitive to HSI noise.
To overcome the limitation of vanilla convolution for HSI denoising, we propose a CDC-based architecture. The CDC exploits gradient features through a center difference strategy, enabling a sensitive awareness of noise, which mitigates the limitation of vanilla convolution (which is unable to extract fine-grain noise). More specifically, a novel CDC network (CDCN) is proposed to exploit spatial-spectral information for HSI denoising, which covers the fine-grain information and extracts intrinsic information (e.g., noise) more effectively than a vanilla CNN in complex environments.
The contribution of this paper is three-fold:
1. To accurately capture noise patterns, the CDCN is proposed, which completely learns the mapping between the noisy image and the clean image using the CDC strategy. CDCN can achieve better noise removal than vanilla convolution by simultaneously extracting the gradient and intensity features in the CDC. Focusing more attention on the center of the image, the CDC also contributes to denoising performance because the center contains more information.
2. To improve the feature extraction capabilities of the CDCN, separable convolutions are utilized to extract spatial and spectral features so that it can leverage on both spectral and spatial features for denoising. Based on separable convolutions, a feature learning module is designed that concurrently learns spatial and spectral features.
3. To explore the most relevant features for denoising, a spatial-spectral attention module is adopted in the CDCN.
The remainder of the paper is organized as follows. Section 2 describes the related works for HSI denoising. Section 3 outlines the proposed methodology. Section 4 presents a preliminary evaluation of the proposed methodology, while the complete experimental results of the proposed and baseline models are discussed in Section 5. In Section 6, a complete discussion is being made why the proposed method performs better than the rest of the models, and finally conclusions with future work directions are made in Section 7.
To solve the HSI denoising problem, various methods have been proposed, which can be broadly categorized into two classes: classical and deep learning methods.
Classic methods can be categorized into spatial and transform domain methods.
These approaches remove noise in the spatial domain, better preserving spatial and spectral information. These methods generally work on reasonable priors or assumptions. For spatial-domain approaches, the noisy HSI is mapped to a clean one. For instance, non-local [18–20] total variation [21], low-rank models [22–25], and sparse representation [26,27] techniques belong to spatial-domain methods. Yuan et al. [21] presented a spatial-spectral adaptive total variation method to minimize the noise from HS images. Chen et al. [9] proposed block-matching 4D filtering (BM4D) for HSI denoising. Similarly, a sparse-based representation technique has been proposed by Li et al. [27] to reduce noise in HSIs, utilizing both inter-band and intra-band structures in a spatial-spectral distributed sparse representation strategy. The low-rank tensor approximation model (LRTAM) proposed by Renard et al. [28] is also a typical spatial-domain method. LRTAM performs low-rank approximation on the spatial dimension after spectral dimensionality reduction. Similarly, Zhang et al. [23] proposed a low-rank matrix recovery (LRMR) algorithm for HSI denoising. The spatial-spectral total-variation regularized low-rank tensor factorization (SSTV-LRTF) has been introduced by Fan et al. [29] to minimize the mixture of noises from HSI. Xie and Li [10] introduced a non-convex low-ranked regularizer known as weighted-Schatten p-norm. Zhao and Yang [30] introduced a sparse spectral domain coding to capture local and global redundancy and correlation (RAC). Latexier and Bourennane [31] exploited the multidimensional Wiener filter (MWF) for HSI denoising.
2.1.2 Transform-Domain Methods
Transform-domain methods separate the clean data from the noisy ones in the transformed domain. Different transformation techniques, such as principal component analysis (PCA), the wavelet transform, and the Fourier transform, are used to transform the data into another space, where clean data are separated from noise. For example, a hybrid denoising model for spatial and spectral domains (HSSNR) has been proposed by Othman and Qian [32] utilizing a wavelet shrinkage technique for noise reduction. Atkinson et al. [33] introduced an estimation-based strategy to get the HSI in the noisy observation using the discrete Fourier transform (DFT) and wavelets to overcome the shortcomings of the Wiener filter. A method based on the first-order roughness penalty (FORP) has been presented by Rasti et al. [34]. These techniques have several primary functions, including feature representation of the noisy HSI and its mapping to the clean one. However, parameters in these methods have to be tuned for each HSI to achieve good efficiency of the underlying models. Besides, these approaches show sensitivity and instability for several bands of HSIs. Ashraf et al. [35] proposed MAOTformer to remove the noise from HSI.
However, both spatial and transform-domain methods have some limitations. Spatial domain methods may struggle to consider variations in the geometrical structure and relationships, causing a performance reduction for HSI denoising. Instead, transform-domain methods often require a deep tuning of parameters, even exhibiting sensitivity to some specific HS bands. Moreover, classical methods are limited by the engineering design in the feature extraction phase and usually work thanks to the design of some priors and/or assumptions that may be unsuitable in certain cases.
2.2 Deep Learning-Based Methods
Deep learning-based methods have recently demonstrated their effectiveness for various tasks, such as HSI classification [36,37], caption generation [38], and sharpening [39,40]. For image denoising, early works [41] exploited CNNs for feature extraction. The performance of these methods is considered sufficient for natural images. However, there is still room for improvement in the case of HSI (thanks to the proper consideration of the crucial spectral dimension of HSIs).
Recent research has exploited spatial and spectral information. The well-known 3D-DnCNN [41] has been utilized for HSI denoising, which enables consideration of three adjacent bands simultaneously with a 3D CNN. Although this method produces good results by retaining the spatial-spectral correlation among adjacent bands, the high number of HS spectral bands represents a significant limitation for this approach. Maffei et al. [42] claimed that traditional frameworks could not handle the high correlations among adjacent bands in a high spectral dimensionality. As a result, low-quality denoising is performed for HSIs. Leveraging on rich information in the spectral dimension, Yuan et al. [12] proposed a residual technique for HSIs to remove noise, where spatial and spectral information is simultaneously fed to the network. Moreover, multilevel representation and multiscale features are used for the final restoration. Dong et al. [14] introduced a spatial-spectral strategy to minimize the noise from HSIs. They introduced a modified 3D U-Net with separable filters, making the method computationally efficient. Zhang et al. [16] presented a gradient network to remove the hybrid noise from spatial-spectral pixels of HSIs. Furthermore, Cao et al. [15] introduced a deep spatial-spectral global reasoning framework for HSI denoising.
To explore long-range dependency for HSI denoising, many attention mechanisms have been integrated into HSI denoising networks. Kan et al. [43] addressed the high-frequency features of HSIs. They claimed that such features have more noise and introduced an attention-based octave network to reduce the noise from such features of HSIs. Wang et al. [44] presented a cross-attention-based network to minimize the noise from HSIs. This network used the attention module with a group of convolutions to extract the features by focusing on the most relevant bands. A CNN and a transformer-based strategy have been introduced by Gong et al. [45] to remove the noise in HSIs. Murugesan et al. [46] introduced an attention-based U-Net to extract the noise features. A mixed attention network for HSI denoising is proposed by Lai and Fu [47], who utilized a multi-head recurrent spectral attention mechanism to integrate inter-spectral features across all spectral bands. Although different technologies, such as spectral-spatial feature extraction and attention, have been adopted for HSI denoising, almost all the proposed networks are based on vanilla convolution. However, vanilla convolution cannot exploit high-frequency features, thus limiting its capability to extract fine-grained features and complex noise. On the other hand, the proposed network based on CDC is sensitive to high-frequency features by the gradient information, enabling the ability to extract fine-grained features and complex noise. Duan et al. [48] proposed a linear attention Mamba (LaMamba) to remove the noise from the HSI 3D selective scan mechanism, which was designed to obtain the spatial spectral continuous sequences with the help of six bidirectional scan orders. Similarly, Nachimuthu et al. [49] introduced a novel SqueezeNet-based denoising framework that leverages Fire modules for efficient feature extraction with fewer parameters.
The proposed model is designed to reduce noise from HSIs by utilizing spatial and spectral information. We introduce a CDCN inspired by a center difference strategy [50] applied to the vanilla convolutional network for better generalization and representation. CDC is used to provide better results in extracting local structural properties. In contrast to the conventional convolution, CDC adds another term of center-of-differentiate, which directly represents the difference in intensity between the central pixel and the pixel differences. Such an operation improves the description of local gradient information, which is especially useful in differentiating between structural features and noise. Noise in HSI can be difficult to notice since it exists as subtle variations within spatial regions and spectral bands, and even fine spatial structures can easily be destroyed in the process of denoising. CDC focuses on the local intensity differences to enable the network to capture spatial discontinuities like edges and textures and reduce noise elements. Unlike the initial structure of the CDCN, which uses the CDC as a part of a traditional 2D convolutional network, the proposed architecture utilizes the CDC as a redesigned feature extractor together with separable convolution layers, skip connections, and spatial-spectral attention to utilize the accurate spatial-spectral properties of HSI.
In Eq. (1), given a 3D cube,
where
As shown in Fig. 1, our CDCN consists of four parts: 1) spatial-spectral feature extraction module; 2) attention module; 3) CDC-based denoising block module; and 4) fully connection layer. First, the proposed CDCN takes as input the simulated noisy bands which is represented by n-th (for spatial information) and adjacent bands (for spectral information) to obtain spatial and spectral features. Adjacent bands help to enhance the quality of the image and increase the correlation among the different HS bands for subsequent analysis. Afterwards, spatial and spectral features are concatenated and passed to the attention mechanism to enhance the extracted features and to exploit spectral-spatial dependency for HSI denoising. Thus, the attention-enhanced spatial-spectral features are ready for the denoising module, where nine blocks based on CDC are employed to perform the denoising operation. Furthermore, the outputs of all the blocks are concatenated and enhanced again by the attention module. Finally, these enhanced features are passed to the fully connected layer for predicting noise,

Figure 1: Flow diagram of the CDCN architecture for noise removal from HSIs. Concat is the concatenation operator.
3.3 Spatial-Spectral Feature Extraction Module
HS images contain abundant spectral information, which can be used to improve the denoising process. More specifically, HS images have high similarity and correlation in the textural characteristics and surface features. Accordingly, to properly utilize the high correlation, the proposed method takes a band with its adjacent bands as input. The details of spectral bands formulation are shared from Eqs. (2) to (6).
Let the n-th band be denoted as
where
More specifically, as shown in Fig. 2, this module consists of two branches to extract the spectral and spatial features using different convolution kernels. The upper branch of the module uses vanilla 2D convolutions to extract the spatial information of the current band by using the 2D convolution, which is defined as The spatial branch applies 2D convolution defined as
where

Figure 2: Spatial-spectral extraction module. Conv indicates the convolution operation, Concat is the concatenation operator, and K represents the number of adjacent bands.
Similarly, the lower branch extracts the spatial-spectral information from a 3D cube. To reduce the computational complexity of 3D convolution, a separable convolution strategy is utilized to achieve 3D convolution. Besides, separable 3D convolution can better deal with the structural dissimilarity between spectral and inter-spatial features than vanilla 3D convolutions. That is because the vanilla 3D kernels cannot manage different structural information in a proper way. Moreover, the separable 3D convolution can reduce the number of parameters to stabilize the training of the network. In separable 3D convolution, the 1-D kernel extracts the spectral information, while the 2D kernel focuses on the spatial dimension of the HSI. So, for spectral-spatial extraction, separable 3D convolution is adopted. First, spectral convolution is applied as
followed by spatial convolution
where
After that, the extracted spectral and spatial information is then concatenated and fed into the attention mechanism to acquire enhanced features.
An attention module is proposed that contains channel attention and spatial attention to improve the feature representation. Specifically, the channel attention module helps the network emphasize informative spectral channels while suppressing less relevant ones, which is particularly important in hyperspectral imagery where different spectral bands contribute differently to the denoising process. Meanwhile, the spatial attention module focuses on significant spatial regions and improves the representation of important structural details such as edges and textures. These mechanisms enable the network to effectively capture both spectral dependencies and spatial structures, thereby improving the overall denoising performance, which is presented in Fig. 3. The first part learns the channel attention maps,
where

Figure 3: (a) Attention mechanism, (b) illustration of the channel attention module, and (c) illustration of the spatial attention module.
This module combines feature maps concerning the spatial domain by utilizing the average and max-pooling layers. Features are forwarded by the attention module via a shared multi-layer perceptron (MLP), which has only a hidden layer, with 16 input channels. Channel attention maps,
where
In this module, the attention mechanism focuses on the spatial domain. Features maps
where
The center of the image contains important information as claimed by [50]. This phenomenon is particularly remarkable for HSIs. Some bands at the beginning and at the end of the HSI are corrupted by noise, as mentioned in [12,42,52]. Therefore, we adopted a center difference strategy in our denoising module to reduce the noise from the HSI. CDC (center-difference convolution) modifies the standard convolution by incorporating the intensity difference between the central pixel and its neighboring pixels and makes the denoising operation efficient by giving more attention to the important features of the image. Our model removes the noise from HSIs better than vanilla convolution as discussed in Section 4. As far as we know, the proposed architecture is the first one trying to apply the center difference convolution as a denoising model for HSIs.
For an input feature map
where
Convolution operations are performed on the center of the image by setting the kernel size to

Figure 4: Composition of the CDCN block. Concat is the concatenation operator.

Figure 5: Center difference convolution operation. Conv indicates the convolution operation.
The denoising module shown in Fig. 1 consists of several blocks. Each block extracts the noisy features using a U-Net-like structure, as shown in Fig. 4. The downsampling operation is followed by the CDC scheme. The downsampling is performed by the average pooling layer, which reshapes the HSI,
3.6 Multi-Stage Feature Representation for Reconstruction
Different levels of features are indirectly connected to the layers at various depths layers. To effectively exploit these hierarchical features without making the direct attenuation, it is better to combine these several feature maps with the final denoising [53]. Consequently, as given in Fig. 1, features of multiple CDC blocks are concatenated to obtain a multi-stage feature representation in our model.
These multi-stage representations can be considered as multilevel skip connections [54], effectively overcoming the vanishing gradient problem [41]. The concatenation operation is shared in Eq. (11).
where
For reproducibility, we provide explicit architectural specifications of the proposed network. The spatial feature extraction module employs three parallel convolution layers with kernel sizes of
Downsampling is performed using
Multi-level skip connections are introduced between the encoder and decoder stages to preserve spatial information and alleviate the vanishing gradient problem.
Vanilla Convolution: The basic operation in CNN-based architectures is the 2D spatial feature extraction. Here we present a short review of vanilla convolution. It contains two steps for feature extraction. In the first stage, the sampling is performed from the input features map,
where
Center Difference Convolution (CDC): A central difference technique is integrated into the convolution to improve the generalization and representation of the vanilla convolution. The CDC also contains two stages similar to that in the vanilla convolution, which are sampling and aggregation, as shown in Fig. 5. The sampling stage is the same as for the vanilla convolution, whereas the aggregation stage is different. The CDC network gives more attention to the center of the image and aggregates the center-oriented values by a difference strategy. Hence, (12) becomes:
where if
where
4.1 Network Computational Complexity Analysis
Let the input feature map be of size
For a vanilla convolution layer, the computational complexity is
The proposed center-difference convolution (CDC) introduces an additional subtraction operation inside the aggregation stage. However, this operation is performed within the same receptive field and does not introduce extra nested loops over spatial or channel dimensions.
Therefore, the computational complexity of CDC remains
which is the same order as vanilla convolution. The difference lies only in a constant factor due to the additional center-difference computation.
Since our denoising module stacks
Thus, the proposed model maintains the same asymptotic complexity as standard CNN-based architectures while providing improved denoising capability.
Data are normalized before feeding them into the network to make the information more appropriate. HS images have spatial and spectral dimensions, containing redundant spectral information, which cannot be ignored for denoising. Therefore, we train our model in an end-to-end fashion by employing a 3D spatial-spectral cube to solve the noise problem, thus benefiting from the high number of spectral bands. More specifically, we made a spectral-based data cube,
After building the framework for HSI denoising, a loss function has been defined for the whole network. The
where
To test the effectiveness of our CDCN model in removing noise from HSIs, we performed experiments on both simulated and real data. The related images are shown in Fig. 6. The performance of the proposed network is compared with existing solutions, i.e., LRMR [23], DnCNN [41], MemNet [55], DeNet [11], GradNet [56], ENCAM [57], HSIDwRD [58], and MAN [47], SST [59] SERT [60] and SSIT [61]. Qualitative and quantitative experiments have been provided using different quality metrics for performance assessment.

Figure 6: HSIs used in our experiments with pseudo color representation: (a) Kennedy space center shown in pseudo color with (43, 21, 11) bands, (b) Houston-2018 shown in pseudo color with (2, 3, 35) bands, and (c) Pavia center shown in pseudo with (97, 3, 2) bands.
To quantitatively assess the performance, three metrics have been used, i.e., the peak signal-to-noise ratio (PSNR), the structural similarity index measurement (SSIM), and the spectral angle mapper (SAM). These indicators are widely exploited to measure the performance of HSI denoising.
The PSNR metric can be defined as in Eq. (19):
where
Instead, the SSIM index is defined as:
where
Finally, the SAM quality metric is as follows:
where
For the simulated experiments, the Kennedy Space Center (KSC) dataset (that is publicly available) has been used, having a size of
Case 1: In the first case, we added noise of equal intensity for the different spectral bands. For instance, we set


Case 2: We added noise with random intensities
Case 3: In this case, we utilized the Gaussian noise to generate the noisy image for the simulated experiments. Gaussian noise distributions for the different bands have been used in this case.
For simulated data, one model is trained for all the test cases. More specifically, there are three training stages. In the first stage, we added the noise levels
Fig. 7 show the spectral response curves at three different spatial locations, i.e., pixel (30, 30), (50, 50), and (70, 70), under the high noise intensity of

Figure 7: (a) Pixel (30, 30), (b) (50, 50), and (c) (70, 70) denoising curves achieved at
Overall, the outputs achieved from different spatial locations confirm that the proposed method achieves good and consistent denoising performance. It efficiently removes high-intensity noise while preserving spectral information, which is crucial for reliable HSI analysis.
First, we measured the performance of our model according to the first case, where we added the noise with the same intensity to the different bands. The experimental results of the first case of

Figure 8: Denoised images for the KSC dataset under noise level

Figure 9: Close-ups for the denoised images for the KSC dataset under noise level

Figure 10: (a) PSNR, (b) SSIM, and (c) SAM values at noise level
Table 2 and Fig. 11 present the results about the second case, where we performed the experiments using

Figure 11: Denoised images for the KSC dataset for Case 2: (a) Pseudo color of noisy image with (43, 21, 11) bands, (b) LRMR, (c) DnCNN, (d) MemNet, (e) DeNet, (f) GradNet, (g) ENCAM, (h) HSIDwRD, (i) MAN, (j) SST, (k) SERT, (l) SSIT and (m) proposed CDCN.

Figure 12: Close-ups for the denoised images for the KSC dataset under noise level

Figure 13: (a) PSNR, (b) SSIM, and (c) SAM values at noise level
Table 2 and Fig. 14 report the outcomes of the Gaussian noise (Case 3), where our model shows again superior performance with respect to the benchmark. MAN obtained the lowest quantitative performance, Whereas LRMR, DnCNN, MemNet could not perform well on Gaussian noise. On the other hand DeNet, GradNet, HSIDwRD and SST produced the better results, Similarly ENCAM generated highest SSIM i.e.,

Figure 14: Denoised images for the KSC dataset for Case 3: (a) Pseudo-color noisy image with (43, 21, 11) bands, (b) LRMR, (c) DnCNN, (d) MemNet, (e) DeNet, (f) GradNet, (g) ENCAM, (h) HSIDwRD, (i) MAN, (j) SST, (k) SERT, (l) SSIT and (m) Proposed CDCN.

Figure 15: Close-ups for the denoised images for the KSC dataset under noise level
For HSI interpretation, the quality of the spectral signatures is essential because they enable us to distinguish the ground objects based on their physical properties. To check the effectiveness of the experimental results, each experiment was repeated three times, and the final results are reported as mean
5.4 Training Convergence Analysis
Fig. 10 shows the training development of various HSI denoising models over epochs with respect to the metrics of PSNR, SSIM and SAM at
It is also possible to note that most of the methods improve their performance at a rapid rate during the initial training epochs and then gradually stabilize as the models approach convergence. It is worth noting that, the proposed CDCN model is always characterized by better PSNR and SSIM values with lower SAM values than the rival approaches. Furthermore, it is observed that the performance curves of CDCN stabilize after a few epochs, which shows that the network is optimized and converges stably. Those observations indicate that the CDCN framework can learn discriminative spectral spatial representations and produce the best denoising results in the course of training.
The training curves on the random noise of different methods are shown in Fig. 13, it can be noted that the deep learning based methods achieve the incremental improvement in the denoising of the results as the number of training runs. The SST, SERT SSIT and CDCN models demonstrate consistent stability of convergence in all evaluation measures. specifically, the PSNR curves increase steadily trend and attains a competitive progress when we compare with other techniques. In the same way, the SSIM training curves also increases steadily throughout training which means that the suggested model is effective in terms of preserving structural information and eliminating the noise. The spectral distortion during the training process is relatively low with regard to the proposed method in terms of the SAM metric. These findings indicate that the proposed architecture is capable of effectively learning strong spatial-spectral representations even when there is a noise randomly distributed.
Fig. 16 presents the convergence behavior of different comparing methods undr the Gaussian noise setting, as the number of epochs increases all the methods improve their denoising efficiency, however some models could not improve their learning such as DnCNN and MemNet on the different metrics such as PSNR, SSIM and SAM. SST learning performance is not satisfactory as in the

Figure 16: (a) PSNR, (b) SSIM, and (c) SAM values at noise level
In real cases, HSI datasets contain different types of noises. Therefore, two real-world datasets have been selected to assess the effectiveness of our proposed network, i.e., Houston-2018 (HT) and Pavia Center (PC). Different methods have been used to minimize the noise from these two real datasets, at the end to checkout the effectiveness of the denoising, additionally the classification experiments were performed using the SVM classifier on the denoised HS Images, obtained from the denoisers. For this purpose two well known classification metrics were used i.e., Over all accuracy (OA), refers to the proportion of correctly classified samples out of total samples and the other was Kappa cofficient which measure the agreement between predicted values and the actuial classified values, while accounting for the possibility of agreement occurring by chance. Finally, the quantitative and the qualitative results of the all the methods were listed in Tables 3 and 4 for the comparison.


The HT dataset has been collected by combining the efforts of the IEEE Geoscience and Remote Sensing Society (GRSS) and Houston University. This dataset was introduced first in January 2018 for a data fusion contest. HT [63] was acquired by the ITRES CASI 1500 instrument. We took the image of size

Figure 17: Outcomes for the HT dataset: (a) pseudo-color noisy image (bands 2, 3, and 35), (b) LRMR, (c) DnCNN, (d) MemNet, (e) DeNet, (f) GradNet, (g) ENCAM, (h) HSIDwRD, (i) MAN, (j) SST, (k) SERT, (l) SSIT and (m) proposed CDCN. Close-ups are indicated with yellow rectangles.

Figure 18: Outcomes for the HT dataset: (a) noisy image (band 2), (b) LRMR, (c) DnCNN, (d) MemNet, (e) DeNet, (f) GradNet, (g) ENCAM, (h) HSIDwRD, (i) MAN, (j) SST, (k) SERT, (l) SSIT and (m) proposed CDCN. Close-ups are indicated with red rectangles.

Figure 19: (a) PSNR, (b) SSIM, and (c) SAM curves generated during the noise removal from real data (HT).

Figure 20: Classification maps for the HT dataset using an SVM classifier before and after noise removal: (a) ground-truth, (b) LRMR, (c) DnCNN, (d) MemNet, (e) DeNet, (f) GradNet, (g) ENCAM, (h) HSIDwRD, (i) MAN, (j) SST, (k) SERT, (l) SSIT and (m) proposed CDCN.
The ROSIS sensor collected the PC dataset over Pavia, a city in northern Italy. It has a size of

Figure 21: Outcomes for the PC dataset: (a) pseudo-color noisy image (bands 97, 3, and 2), (b) LRMR, (c) DnCNN, (d) MemNet, (e) DeNet, (f) GradNet, (g) ENCAM, (h) HSIDwRD, (i) MAN, (j) SST, (k) SERT, (l) SSIT and (m) proposed CDCN. Close-ups are indicated with yellow rectangles.

Figure 22: Outcomes for the PC dataset: (a) noisy image (band 2), (b) LRMR, (c) DnCNN, (d) MemNet, (e) DeNet, (f) GradNet, (g) ENCAM, (h) HSIDwRD, (i) MAN, (j) SST, (k) SERT, (l) SSIT, and (m) proposed CDCN.Close-ups are indicated with red rectangles.
We considered 102 spectral bands for our experiments removing the noise from the HSI. The classification results are reported in Table 4. The proposed method got the highest OA and kappa coefficient values. The classification maps for the compared approaches are shown in Fig. 23.

Figure 23: Classification maps for the PC dataset using an SVM classifier before and after denoising: (a) ground-truth, (b) LRMR, (c) DnCNN, (d) MemNet, (e) DeNet, (f) GradNet, (g) ENCAM, (h) HSIDwRD, (i) MAN, (j) SST, (k) SERT, (l) SSIT, and (m) proposed CDCN.
5.6 Statistical Significance Analysis
To evaluate the statistical significance of the proposed CDCN framework, a two-sample t-test is performed by comparing its overall accuracy (OA) with the mean OA of competing methods across both datasets. For statistical analysis, the results are taken from Tables 3 and 4. The statistical analysis is conducted using aggregated performance metrics, treating different methods as independent samples.
The computed p-values for both datasets are shared in the Table 5 with a significance threshold of 0.05, indicating that the performance improvements achieved by the proposed CDCN framework are statistically significant and not due to random variation.

The effectiveness of the CDCN structure is discussed in this subsection. We introduced the CDC-based architecture with an attention module in the HSI denoising domain. For the ablation study, we used the Houston dataset to measure the effectiveness of each module of our proposed architecture. The results of the ablation study are reported in Table 6:
1. CDCN w/o attention refers to the proposed CDCN without the attention module.
2. CDCN w/o CDC means that only vanilla convolutions are adopted, whereas the CDC has been removed. This model can also be viewed as the proposed CDCN with
3. CDCN is the proposed network.

The comparison with CDCN w/o CDC and the proposed CDCN shows that using the CDC property produced better results on two metrics (i.e., SSIM and SAM) but decreased the PSNR value. Comparing CDCN w/o CDC and the proposed CDCN, we have vanilla convolutions replace the CDC in the proposed network to get the CDCN w/o CDC approach. Experimental results show that CNCD w/o CDC produced worse results than the proposed CDCN, considering both SSIM and SAM indexes, thus proving the effectiveness of the proposed CDC module. Hence, the outcomes show that our method (which applies attention and CDC) is superior to traditional CNN-based solutions.
The computational efficiency of the proposed framework is analyzed by measuring the average running time of different methods on simulated data. The results are summarized in Table 7. All experiments are conducted using the hardware configuration described in the experimental setup, which includes an NVIDIA Tesla T4 GPU with 16 GB GPU memory, and the models were implemented using the PyTorch deep learning framework.

Deep learning-based approaches generally demonstrate faster processing compared with traditional optimization-based methods. As shown in Table 7, HSIDwRD achieves the shortest running time of 2.03 s, while the proposed CDCN model requires approximately 5.11 s per image during inference. Based on this inference time, the corresponding processing speed is approximately 0.20 frames per second (FPS).
Although the proposed model does not strictly satisfy real-time processing requirements, it demonstrates competitive computational efficiency compared with several existing deep learning models while achieving superior denoising performance. The proposed CDCN method ranks among the top-performing approaches in terms of efficiency while delivering improved qualitative and quantitative results due to the integration of attention mechanisms and structural feature extraction. This indicates that the proposed framework achieves a favorable balance between computational cost and reconstruction quality.
Table 7 also provides the model complexity of all the comparing methods and the proposed model in terms of parameter count and GFLOPs. It is proved that there a trade-off between representational capacity and computational efficiency. Models such as MemNet, LRMR, and ENCAM have higher parameter counts, i.e, 2.90M, 2.78M, and 2.56M, respectively, which may enhance feature extraction capability but at the cost of increased computational burden. This is further reflected in their GFLOPs, where ENCAM and MemNet attain 6.12 and 5.84 GFLOPs, respectively.
In contrast, lightweight models such as HSIDwRD and CDCN achieve remarkable efficiency. HSIDwRD has the smallest number of parameters (0.25M), while CDCN requires the lowest computational cost (0.10 GFLOPs). Notably, these models maintain a favorable balance between complexity and efficiency, making them promising algorithms for HSI denoising. These observations show the importance of designing architectures that balance performance and efficiency, especially for practical applications where both memory and computational constraints are critical considerations.
We performed the experiments on simulated and real datasets using the CDCN-based network instead of employing the simple vanilla convolutional layers. Seven different noise levels were used to train the model, and the performance was compared with eleven well-known classic, deep learning and transformer based algorithms using the same quality indexes, i.e., PSNR, SSIM, and SAM. We can conclude that CDCN performed better than the compared approaches, effectively capturing the details of the fine noise and retaining the structure of the HS image. The numerical results show that CDCN removes most of the noise during the noise-removing process. However, with random noise, CDCN showed lower SSIM values than the ENCAM algorithm. On the other side, CDCN got a clear superiority in terms of numeric results on all the comparing methods. Another point to discuss is related to the ablation study. Indeed, the proposed CDCN denoised the HSI better with the attention mechanism than the vanilla CNN with the attention module. The last point is instead about the model’s efficiency in terms of inference efficiency, number of parameters, and computational cost (GFLOPs), computed in Table 7. It can be noted that CDCN requires

Figure 24: Proposed CDCN model training loss curve.
This research proposed a deep-learning-based architecture to reduce noise in HSIs. CNN-based methods provide deep semantic features, but these networks cannot extract fine-grained information in environmentally sensitive cases. HSIs are ecologically sensitive. We presented a novel center difference convolution network to address such issues, exploiting both spatial and spectral information of HSIs. The proposed method extracted features by giving a weightage to the center of the image because most of the information is present there. Therefore, the proposed model extracted features using center difference convolutions. Furthermore, this research utilizes an attention mechanism with CDCN to refine the HSI features in the spatial-spectral domain. The results are further validated using an ablation study and a computational analysis, pointing out that our solution is also cost-efficient.
One major limitation of the proposed framework is that it is evaluated primarily on publicly available datasets, which may not fully represent the diversity and variability encountered in real-world scenarios. The proposed framework achieves an inference speed of approximately 0.20 FPS with the current implementation, which is not suitable for real-world deployment. We will aim to make the proposed framework lightweight. Secondly, although the model demonstrates strong predictive capability, deep learning models may still suffer from limited interpretability, which can affect their practical adoption in certain application domains.
Future work will focus on addressing these limitations by extending the framework to larger and more diverse datasets collected from multiple sources to improve generalization capability. In addition, future studies may explore lightweight model architectures and model compression techniques to reduce computational complexity and facilitate real-time deployment. Lastly, incorporating advanced explainable artificial intelligence (XAI) techniques could improve the interpretability of the model.
Acknowledgement: This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. RS-2023-00218176) and the Soonchunhyang University Research Fund.
Funding Statement: This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. RS-2023-00218176) and the Soonchunhyang University Research Fund.
Author Contributions:: The authors confirm contribution to the paper as follows: Conceptualization, Muhammad Umer; methodology, Mahmood Ashraf, Muhammad Umer and Raed Alharthi; software, Mahmood Ashraf, Nuha Zamzami, Muhammad Umer, Shtwai Alsubai and Raed Alharthi; validation, Yunyoung Nam; formal analysis, Yunyoung Nam and Yongwon Cho; investigation, Nuha Zamzami; data curation, Muhammad Umer and Shtwai Alsubai; writing—original draft preparation, Mahmood Ashraf, Muhammad Umer, Nuha Zamzami and Shtwai Alsubai; writing—review and editing, Raed Alharthi, Yunyoung Nam and Yongwon Cho; visualization, Mahmood Ashraf; supervision, Yunyoung Nam and Yongwon Cho; project administration, Yunyoung Nam and Yongwon Cho; funding acquisition, Yunyoung Nam and Yongwon Cho. All authors reviewed and approved the final version of the manuscript.
Availability of Data and Materials: The datasets utilized in this research are publicly available and can be accessed via the URL:
2. PC: https://www.ehu.eus/ccwintco/index.php/Hyperspectral_Remote_Sensing_Scenes#Pavia_Centre_scene
3. HT: https://github.com/YuxiangZhang-BIT/Data-CSHSI?tab=readme-ov-file
The specific details regarding the datasets, including any necessary information on how to access or cite them, can be found at the provided URL(s). All data used in this study can be freely obtained from the aforementioned source.
Ethics Approval: Not applicable.
Conflicts of Interest: The authors declare no conflicts of interest.
References
1. Harsanyi JC, Chang C-I. Hyperspectral image classification and dimensionality reduction: an orthogonal subspace projection approach. IEEE Trans Geosci Remote Sens. 1994;32(4):779–85. [Google Scholar]
2. Su H, Cai Y, Du Q. Firefly-algorithm-inspired framework with band selection and extreme learning machine for hyperspectral image classification. IEEE J Sel Top Appl Earth Obs Remote Sens. 2016;10(1):309–20. doi:10.1109/jstars.2016.2591004. [Google Scholar] [CrossRef]
3. Su H, Du Q, Chen G, Du P. Optimized hyperspectral band selection using particle swarm optimization. IEEE J Sel Top Appl Earth Obs Remote Sens. 2014;7(6):2659–70. doi:10.1109/jstars.2014.2312539. [Google Scholar] [CrossRef]
4. Plaza A, Benediktsson JA, Boardman JW, Brazile J, Bruzzone L, Camps-Valls G, et al. Recent advances in techniques for hyperspectral image processing. Remote Sens Environ. 2009;113:S110–22. doi:10.1016/j.rse.2007.07.028. [Google Scholar] [CrossRef]
5. Lu X, Wang Y, Yuan Y. Graph-regularized low-rank representation for destriping of hyperspectral images. IEEE Trans Geosci Remote Sens. 2013;51(7):4009–18. doi:10.1109/tgrs.2012.2226730. [Google Scholar] [CrossRef]
6. Dabov K, Foi A, Katkovnik V, Egiazarian K. Image denoising by sparse 3-D transform-domain collaborative filtering. IEEE Trans Image Process. 2007;16(8):2080–95. doi:10.1109/tip.2007.901238. [Google Scholar] [PubMed] [CrossRef]
7. Gu S, Zhang L, Zuo W, Feng X. Weighted nuclear norm minimization with application to image denoising. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, NJ, USA: IEEE; 2014. p. 2862–9. [Google Scholar]
8. Ye M, Qian Y, Zhou J. Multitask sparse nonnegative matrix factorization for joint spectral-spatial hyperspectral imagery denoising. IEEE Trans Geosci Remote Sens. 2014;53(5):2621–39. doi:10.1109/tgrs.2014.2363101. [Google Scholar] [CrossRef]
9. Chen Y, Cao X, Zhao Q, Meng D, Xu Z. Denoising hyperspectral image with non-iid noise structure. IEEE Trans Cybern. 2017;48(3):1054–66. doi:10.1109/tcyb.2017.2677944. [Google Scholar] [PubMed] [CrossRef]
10. Xie W, Li Y. Hyperspectral imagery denoising by deep learning with trainable nonlinearity function. IEEE Geosci Remote Sens Lett. 2017;14(11):1963–7. doi:10.1109/lgrs.2017.2743738. [Google Scholar] [CrossRef]
11. Chang Y, Yan L, Fang H, Zhong S, Liao W. Hsi-denet: hyperspectral image restoration via convolutional neural network. IEEE Trans Geosci Remote Sens. 2018;57(2):667–82. [Google Scholar]
12. Yuan Q, Zhang Q, Li J, Shen H, Zhang L. Hyperspectral image denoising employing a spatial-spectral deep residual convolutional neural network. IEEE Trans Geosci Remote Sens. 2018;57(2):1205–18. doi:10.1109/tgrs.2018.2865197. [Google Scholar] [CrossRef]
13. Bodrito T, Zouaoui A, Chanussot J, Mairal J. A trainable spectral-spatial sparse coding model for hyperspectral image restoration. Adv Neural Inf Process Syst. 2021;34:5430–42. [Google Scholar]
14. Dong W, Wang H, Wu F, Shi G, Li X. Deep spatial-spectral representation learning for hyperspectral image denoising. IEEE Trans Comput Imaging. 2019;5(4):635–48. doi:10.1109/tci.2019.2911881. [Google Scholar] [CrossRef]
15. Cao X, Fu X, Xu C, Meng D. Deep spatial-spectral global reasoning network for hyperspectral image denoising. IEEE Trans Geosci Remote Sens. 2021;60:55047114. doi:10.1109/tgrs.2021.3069241. [Google Scholar] [CrossRef]
16. Zhang Q, Yuan Q, Li J, Liu X, Shen H, Zhang L. Hybrid noise removal in hyperspectral imagery with a spatial-spectral gradient network. IEEE Trans Geosci Remote Sens. 2019;57(10):7317–29. doi:10.1109/tgrs.2019.2912909. [Google Scholar] [CrossRef]
17. Wei K, Fu Y, Huang H. 3-D quasi-recurrent neural network for hyperspectral image denoising. IEEE Trans Neural Netw Learn Syst. 2020;32(1):363–75. doi:10.1109/TNNLS.2020.2978756. [Google Scholar] [PubMed] [CrossRef]
18. Deng C, Li L, He Z, Li J, Zhu Y. Monte carlo non-local means method for hyperspectral image denoising. In: Proceedings of the 2018 IEEE International Geoscience and Remote Sensing Symposium. Piscataway, NJ, USA: IEEE; 2018. p. 4772–5. [Google Scholar]
19. Zhang R, Yang L, Liu G, Feng X. Weighted nuclear norms of transformed tensors for nonlocal hyperspectral image denoising. IEEE Geosci Remote Sens Lett. 2022;19:16011305. doi:10.1109/lgrs.2022.3186877. [Google Scholar] [CrossRef]
20. Wang Z, Ng MK, Zhuang L, Gao L, Zhang B. Nonlocal self-similarity-based hyperspectral remote sensing image denoising with 3D convolutional neural network. IEEE Trans Geosci Remote Sens. 2022;60:5531617. [Google Scholar]
21. Yuan Q, Zhang L, Shen H. Hyperspectral image denoising employing a spectral-spatial adaptive total variation model. IEEE Trans Geosci Remote Sens. 2012;50(10):3660–77. doi:10.1109/tgrs.2012.2185054. [Google Scholar] [CrossRef]
22. Bourennane S, Fossati C, Juan J. Improvement of classification based on noise and spectral dimensionality reduction for hyperspectral image. Geosci Remote Sens. 2018;1(1):9–17. doi:10.23977/geors.2018.11012. [Google Scholar] [CrossRef]
23. Zhang H, He W, Zhang L, Shen H, Yuan Q. Hyperspectral image restoration using low-rank matrix recovery. IEEE Trans Geosci Remote Sens. 2013;52(8):4729–43. doi:10.1109/tgrs.2013.2284280. [Google Scholar] [CrossRef]
24. Xie S, Wang S, Song C, Wang X. Hyperspectral image reconstruction based on spatial-spectral domains low-rank sparse representation. Remote Sens. 2022;14(17):4184. doi:10.3390/rs14174184. [Google Scholar] [CrossRef]
25. Chen Y, Huang T-Z, He W, Zhao X-L, Zhang H, Zeng J. Hyperspectral image denoising using factor group sparsity-regularized nonconvex low-rank approximation. IEEE Trans Geosci Remote Sens. 2021;60:5515916. [Google Scholar]
26. Gao Q, Lim S, Jia X. Spectral-spatial hyperspectral image classification using a multiscale conservative smoothing scheme and adaptive sparse representation. IEEE Trans Geosci Remote Sens. 2019;57(10):7718–30. doi:10.1109/tgrs.2019.2915809. [Google Scholar] [CrossRef]
27. Li J, Yuan Q, Shen H, Zhang L. Noise removal from hyperspectral image with joint spectral-spatial distributed sparse representation. IEEE Trans Geosci Remote Sens. 2016;54(9):5425–39. doi:10.1109/tgrs.2016.2564639. [Google Scholar] [CrossRef]
28. Renard N, Bourennane S, Blanc-Talon J. Denoising and dimensionality reduction using multilinear tools for hyperspectral images. IEEE Geosci Remote Sens Lett. 2008;5(2):138–42. doi:10.1109/lgrs.2008.915736. [Google Scholar] [CrossRef]
29. Fan H, Li C, Guo Y, Kuang G, Ma J. Spatial-spectral total variation regularized low-rank tensor decomposition for hyperspectral image denoising. IEEE Trans Geosci Remote Sens. 2018;56(10):6196–213. [Google Scholar]
30. Zhao Y-Q, Yang J. Hyperspectral image denoising via sparse representation and low-rank constraint. IEEE Trans Geosci Remote Sens. 2014;53(1):296–308. doi:10.1109/igarss.2013.6721354. [Google Scholar] [CrossRef]
31. Letexier D, Bourennane S. Noise removal from hyperspectral images by multidimensional filtering. IEEE Trans Geosci Remote Sens. 2008;46(7):2061–9. [Google Scholar]
32. Othman H, Qian S-E. Noise reduction of hyperspectral imagery using hybrid spatial-spectral derivative-domain wavelet shrinkage. IEEE Trans Geosci Remote Sens. 2006;44(2):397–408. doi:10.1109/tgrs.2005.860982. [Google Scholar] [CrossRef]
33. Atkinson I, Kamalabadi F, Jones DL. Wavelet-based hyperspectral image estimation. In: Proceedings of the 2003 IEEE International Geoscience and Remote Sensing Symposium. Piscataway, NJ, USA: IEEE; 2003. p. 743–5. [Google Scholar]
34. Rasti B, Sveinsson JR, Ulfarsson MO, Benediktsson JA. Hyperspectral image denoising using first order spectral roughness penalty in wavelet domain. IEEE J Sel Top Appl Earth Obs Remote Sens. 2013;7(6):2458–67. doi:10.1109/jstars.2013.2272879. [Google Scholar] [CrossRef]
35. Ashraf M, Chen L, Zhou X, Rakha MA. A joint architecture of mixed-attention transformer and octave module for hyperspectral image denoising. IEEE J Sel Top Appl Earth Obs Remote Sens. 2024;17:4331–49. [Google Scholar]
36. Wambugu N, Chen Y, Xiao Z, Tan K, Wei M, Liu X, et al. Hyperspectral image classification on insufficient-sample and feature learning using deep neural networks: a review. Int J Appl Earth Obs Geoinf. 2021;105(12):102603. doi:10.1016/j.jag.2021.102603. [Google Scholar] [CrossRef]
37. Sellami A, Tabbone S. Deep neural networks-based relevant latent representation learning for hyperspectral image classification. Pattern Recognit. 2022;121(1):108224. doi:10.1016/j.patcog.2021.108224. [Google Scholar] [CrossRef]
38. Das R, Singh TD. Assamese news image caption generation using attention mechanism. Multimed Tools Appl. 2022;81(7):10051–69. doi:10.1007/s11042-022-12042-8. [Google Scholar] [CrossRef]
39. Chen L, Lai Z, Vivone G, Jeon G, Chanussot J, Yang X. ArbRPN: a bidirectional recurrent pansharpening network for multispectral images with arbitrary numbers of bands. IEEE Trans Geosci Remote Sens. 2022;60:5406418. [Google Scholar]
40. Chen L, Vivone G, Nie Z, Chanussot J, Yang X. Spatial data augmentation: improving the generalization of neural networks for pansharpening. IEEE Trans Geosci Remote Sens. 2023;61:5401711. [Google Scholar]
41. Zhang K, Zuo W, Chen Y, Meng D, Zhang L. Beyond a gaussian denoiser: residual learning of deep cnn for image denoising. IEEE Trans Image Process. 2017;26(7):3142–55. [Google Scholar] [PubMed]
42. Maffei A, Haut JM, Paoletti ME, Plaza J, Bruzzone L, Plaza A. A single model CNN for hyperspectral image denoising. IEEE Trans Geosci Remote Sens. 2019;58(4):2516–29. [Google Scholar]
43. Kan Z, Li S, Hou M, Fang L, Zhang Y. Attention-based octave network for hyperspectral image denoising. IEEE J Sel Top Appl Earth Obs Remote Sens. 2021;15:1089–1102. [Google Scholar]
44. Wang Z, Shao Z, Huang X, Wang J, Lu T. SSCAN: a spatial-spectral cross attention network for hyperspectral image denoising. IEEE Geosci Remote Sens Lett. 2021;19:5508805. [Google Scholar]
45. Gong Z, Gao F, Dong J, Qi L. Hyperspectral image denoising based on parallel cross-fusion network. In: Proceedings of the 2022 IEEE International Geoscience and Remote Sensing Symposium. Piscataway, NJ, USA: IEEE; 2022. p. 1528–31. [Google Scholar]
46. Murugesan R, Nachimuthu N, Prakash G. Attention based deep convolutional U-Net with CSA optimization for hyperspectral image denoising. Infrared Phys Technol. 2023;129(10):104531. doi:10.1016/j.infrared.2022.104531. [Google Scholar] [CrossRef]
47. Lai Z, Fu Y. Mixed attention network for hyperspectral image denoising. arXiv:2301.11525. 2023. [Google Scholar]
48. Duan P, Luo Y, Kang X, Li S. Lamamba: linear attention mamba for hyperspectral image denoising. IEEE Trans Geosci Remote Sens. 2025;63:5527113. [Google Scholar]
49. Nachimuthu N, Murugesan R, Dharmalingam M, Prakash G. Revolutionizing hyper spectral image denoising: a squeezenet paradigm. Sci Rep. 2026;16:7419. [Google Scholar] [PubMed]
50. Yu Z, Qin Y, Li X, Wang Z, Zhao C, Lei Z. Multi-modal face anti-spoofing based on central difference networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. Piscataway, NJ, USA: IEEE; 2020. p. 2766–74. [Google Scholar]
51. Woo S, Park J, Lee J-Y, Kweon IS. CBAM: convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV). Cham, Switzerland: Springer; 2018. p. 3–19. [Google Scholar]
52. Shi Q, Tang X, Yang T, Liu R, Zhang L. Hyperspectral image denoising using a 3-D attention denoising network. IEEE Trans Geosci Remote Sens. 2021;59(12):10348–63. [Google Scholar]
53. Jin KH, McCann MT, Froustey E, Unser M. Deep convolutional neural network for inverse problems in imaging. IEEE Trans Image Process. 2017;26(9):4509–22. doi:10.1109/tip.2017.2713099. [Google Scholar] [PubMed] [CrossRef]
54. Wei Y, Yuan Q, Shen H, Zhang L. Boosting the accuracy of multispectral image pansharpening by learning a deep residual network. IEEE Geosci Remote Sens Lett. 2017;14(10):1795–9. doi:10.1109/lgrs.2017.2736020. [Google Scholar] [CrossRef]
55. Tai Y, Yang J, Liu X, Xu C. MemNet: a persistent memory network for image restoration. In: Proceedings of the IEEE International Conference on Computer Vision. Piscataway, NJ, USA: IEEE; 2017. p. 4549–57. [Google Scholar]
56. Liu Y, Anwar S, Zheng L, Tian Q. GradNet image denoising. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. Piscataway, NJ, USA: IEEE; 2020. p. 2140–9. [Google Scholar]
57. Ma H, Liu G, Yuan Y. Enhanced non-local cascading network with attention mechanism for hyperspectral image denoising. In: Proceedings of the 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Piscataway, NJ, USA: IEEE; 2020. p. 2448–52. [Google Scholar]
58. Zhang T, Fu Y, Li C. Hyperspectral image denoising with realistic data. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. Piscataway, NJ, USA: IEEE; 2021. p. 2228– 37. [Google Scholar]
59. Li M, Fu Y, Zhang Y. Spatial-spectral transformer for hyperspectral image denoising. Proc AAAI Conf Artif Intell. 2023;37(1):1368–76. doi:10.1609/aaai.v37i1.25221. [Google Scholar] [CrossRef]
60. Li M, Liu J, Fu Y, Zhang Y, Dou D. Spectral enhanced rectangle transformer for hyperspectral image denoising. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway, NJ, USA: IEEE; 2023. p. 5805–14. [Google Scholar]
61. Chen Z, Liu C, Zhou J. SSIT: a spatial-spectral interactive transformer for hyperspectral image denoising. Sci Remote Sens. 2025;12:100276. [Google Scholar]
62. Su H, Zhao B, Du Q, Du P, Xue Z. Multifeature dictionary learning for collaborative representation classification of hyperspectral imagery. IEEE Trans Geosci Remote Sens. 2018;56(4):2467–84. doi:10.1109/tgrs.2017.2781805. [Google Scholar] [CrossRef]
63. Le Saux B, Yokoya N, Hansch R, Prasad S. 2018 IEEE GRSS data fusion contest: multimodal land use classification [technical committees]. IEEE Geosci Remote Sens Mag. 2018;6(1):52–4. [Google Scholar]
Cite This Article
Copyright © 2026 The Author(s). Published by Tech Science Press.This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


Submit a Paper
Propose a Special lssue
View Full Text
Download PDF
Downloads
Citation Tools