TY - EJOU AU - Wei, Daoming AU - Ge, Fangyan AU - Zhang, Bopeng AU - Zhao, Zhiqiang AU - Li, Dequan AU - Xi, Lizong AU - Hu, Jinrong AU - Wang, Xin TI - CloudViT: A Lightweight Ground-Based Cloud Image Classification Model with the Ability to Capture Global Features T2 - Computers, Materials \& Continua PY - 2025 VL - 83 IS - 3 SN - 1546-2226 AB - Accurate cloud classification plays a crucial role in aviation safety, climate monitoring, and localized weather forecasting. Current research has been focusing on machine learning techniques, particularly deep learning based model, for the types identification. However, traditional approaches such as convolutional neural networks (CNNs) encounter difficulties in capturing global contextual information. In addition, they are computationally expensive, which restricts their usability in resource-limited environments. To tackle these issues, we present the Cloud Vision Transformer (CloudViT), a lightweight model that integrates CNNs with Transformers. The integration enables an effective balance between local and global feature extraction. To be specific, CloudViT comprises two innovative modules: Feature Extraction (E_Module) and Downsampling (D_Module). These modules are able to significantly reduce the number of model parameters and computational complexity while maintaining translation invariance and enhancing contextual comprehension. Overall, the CloudViT includes 0.93 × 106 parameters, which decreases more than ten times compared to the SOTA (State-of-the-Art) model CloudNet. Comprehensive evaluations conducted on the HBMCD and SWIMCAT datasets showcase the outstanding performance of CloudViT. It achieves classification accuracies of 98.45% and 100%, respectively. Moreover, the efficiency and scalability of CloudViT make it an ideal candidate for deployment in mobile cloud observation systems, enabling real-time cloud image classification. The proposed hybrid architecture of CloudViT offers a promising approach for advancing ground-based cloud image classification. It holds significant potential for both optimizing performance and facilitating practical deployment scenarios. KW - Image classification; ground-based cloud images; lightweight neural networks; attention mechanism; deep learning; vision transformer DO - 10.32604/cmc.2025.061402