TY  - EJOU
AU  - Wei, Daoming 
AU  - Ge, Fangyan 
AU  - Zhang, Bopeng 
AU  - Zhao, Zhiqiang 
AU  - Li, Dequan 
AU  - Xi, Lizong 
AU  - Hu, Jinrong 
AU  - Wang, Xin 

TI  - CloudViT: A Lightweight Ground-Based Cloud Image Classification Model with the Ability to Capture Global Features
T2  - Computers, Materials \& Continua

PY  - 2025
VL  - 83
IS  - 3
SN  - 1546-2226

AB  - Accurate cloud classification plays a crucial role in aviation safety, climate monitoring, and localized weather forecasting. Current research has been focusing on machine learning techniques, particularly deep learning based model, for the types identification. However, traditional approaches such as convolutional neural networks (CNNs) encounter difficulties in capturing global contextual information. In addition, they are computationally expensive, which restricts their usability in resource-limited environments. To tackle these issues, we present the Cloud Vision Transformer (CloudViT), a lightweight model that integrates CNNs with Transformers. The integration enables an effective balance between local and global feature extraction. To be specific, CloudViT comprises two innovative modules: Feature Extraction (E_Module) and Downsampling (D_Module). These modules are able to significantly reduce the number of model parameters and computational complexity while maintaining translation invariance and enhancing contextual comprehension. Overall, the CloudViT includes 0.93 × 10<sup>6</sup> parameters, which decreases more than ten times compared to the SOTA (State-of-the-Art) model CloudNet. Comprehensive evaluations conducted on the HBMCD and SWIMCAT datasets showcase the outstanding performance of CloudViT. It achieves classification accuracies of 98.45% and 100%, respectively. Moreover, the efficiency and scalability of CloudViT make it an ideal candidate for deployment in mobile cloud observation systems, enabling real-time cloud image classification. The proposed hybrid architecture of CloudViT offers a promising approach for advancing ground-based cloud image classification. It holds significant potential for both optimizing performance and facilitating practical deployment scenarios.
KW  - Image classification; ground-based cloud images; lightweight neural networks; attention mechanism; deep learning; vision transformer

DO  - 10.32604/cmc.2025.061402