Open Access iconOpen Access

ARTICLE

crossmark

CloudViT: A Lightweight Ground-Based Cloud Image Classification Model with the Ability to Capture Global Features

Daoming Wei1, Fangyan Ge2, Bopeng Zhang1, Zhiqiang Zhao3, Dequan Li3,*, Lizong Xi4, Jinrong Hu5,*, Xin Wang6

1 National Key Laboratory of Intelligent Spatial Information, Beijing, 100029, China
2 School of Artificial Intelligence, Neijiang Normal University, Neijiang, 641100, China
3 CMA Cloud-Precipitation Physics and Weather Modification Key Laboratory, Beijing, 100081, China
4 Gansu Weather Modification Office, Lanzhou, 730020, China
5 School of Computer Science, Chengdu University of Information Technology, Chengdu, 610225, China
6 Department of Epidemiology and Biostatistics, School of Public Health, University at Albany, State University of New York, New York, NY 12144, USA

* Corresponding Authors: Dequan Li. Email: email; Jinrong Hu. Email: email

Computers, Materials & Continua 2025, 83(3), 5729-5746. https://doi.org/10.32604/cmc.2025.061402

Abstract

Accurate cloud classification plays a crucial role in aviation safety, climate monitoring, and localized weather forecasting. Current research has been focusing on machine learning techniques, particularly deep learning based model, for the types identification. However, traditional approaches such as convolutional neural networks (CNNs) encounter difficulties in capturing global contextual information. In addition, they are computationally expensive, which restricts their usability in resource-limited environments. To tackle these issues, we present the Cloud Vision Transformer (CloudViT), a lightweight model that integrates CNNs with Transformers. The integration enables an effective balance between local and global feature extraction. To be specific, CloudViT comprises two innovative modules: Feature Extraction (E_Module) and Downsampling (D_Module). These modules are able to significantly reduce the number of model parameters and computational complexity while maintaining translation invariance and enhancing contextual comprehension. Overall, the CloudViT includes 0.93 × 106 parameters, which decreases more than ten times compared to the SOTA (State-of-the-Art) model CloudNet. Comprehensive evaluations conducted on the HBMCD and SWIMCAT datasets showcase the outstanding performance of CloudViT. It achieves classification accuracies of 98.45% and 100%, respectively. Moreover, the efficiency and scalability of CloudViT make it an ideal candidate for deployment in mobile cloud observation systems, enabling real-time cloud image classification. The proposed hybrid architecture of CloudViT offers a promising approach for advancing ground-based cloud image classification. It holds significant potential for both optimizing performance and facilitating practical deployment scenarios.

Keywords

Image classification; ground-based cloud images; lightweight neural networks; attention mechanism; deep learning; vision transformer

Cite This Article

APA Style
Wei, D., Ge, F., Zhang, B., Zhao, Z., Li, D. et al. (2025). CloudViT: A Lightweight Ground-Based Cloud Image Classification Model with the Ability to Capture Global Features. Computers, Materials & Continua, 83(3), 5729–5746. https://doi.org/10.32604/cmc.2025.061402
Vancouver Style
Wei D, Ge F, Zhang B, Zhao Z, Li D, Xi L, et al. CloudViT: A Lightweight Ground-Based Cloud Image Classification Model with the Ability to Capture Global Features. Comput Mater Contin. 2025;83(3):5729–5746. https://doi.org/10.32604/cmc.2025.061402
IEEE Style
D. Wei et al., “CloudViT: A Lightweight Ground-Based Cloud Image Classification Model with the Ability to Capture Global Features,” Comput. Mater. Contin., vol. 83, no. 3, pp. 5729–5746, 2025. https://doi.org/10.32604/cmc.2025.061402



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 300

    View

  • 116

    Download

  • 0

    Like

Share Link