CloudViT: A Lightweight Ground-Based Cloud Image Classification Model with the Ability to Capture Global Features

Daoming Wei; Fangyan Ge; Bopeng Zhang; Zhiqiang Zhao; Dequan Li; Lizong Xi; Jinrong Hu; Xin Wang

doi:10.32604/cmc.2025.061402

Open Access icon Open Access

ARTICLE

CloudViT: A Lightweight Ground-Based Cloud Image Classification Model with the Ability to Capture Global Features

Daoming Wei¹, Fangyan Ge², Bopeng Zhang¹, Zhiqiang Zhao³, Dequan Li^3,*, Lizong Xi⁴, Jinrong Hu^5,*, Xin Wang⁶

1 National Key Laboratory of Intelligent Spatial Information, Beijing, 100029, China
2 School of Artificial Intelligence, Neijiang Normal University, Neijiang, 641100, China
3 CMA Cloud-Precipitation Physics and Weather Modification Key Laboratory, Beijing, 100081, China
4 Gansu Weather Modification Office, Lanzhou, 730020, China
5 School of Computer Science, Chengdu University of Information Technology, Chengdu, 610225, China
6 Department of Epidemiology and Biostatistics, School of Public Health, University at Albany, State University of New York, New York, NY 12144, USA

* Corresponding Authors: Dequan Li. Email: email ; Jinrong Hu. Email: email

Computers, Materials & Continua 2025, 83(3), 5729-5746. https://doi.org/10.32604/cmc.2025.061402

Received 23 November 2024; Accepted 20 March 2025; Issue published 19 May 2025

Abstract

Accurate cloud classification plays a crucial role in aviation safety, climate monitoring, and localized weather forecasting. Current research has been focusing on machine learning techniques, particularly deep learning based model, for the types identification. However, traditional approaches such as convolutional neural networks (CNNs) encounter difficulties in capturing global contextual information. In addition, they are computationally expensive, which restricts their usability in resource-limited environments. To tackle these issues, we present the Cloud Vision Transformer (CloudViT), a lightweight model that integrates CNNs with Transformers. The integration enables an effective balance between local and global feature extraction. To be specific, CloudViT comprises two innovative modules: Feature Extraction (E_Module) and Downsampling (D_Module). These modules are able to significantly reduce the number of model parameters and computational complexity while maintaining translation invariance and enhancing contextual comprehension. Overall, the CloudViT includes 0.93 × 10⁶ parameters, which decreases more than ten times compared to the SOTA (State-of-the-Art) model CloudNet. Comprehensive evaluations conducted on the HBMCD and SWIMCAT datasets showcase the outstanding performance of CloudViT. It achieves classification accuracies of 98.45% and 100%, respectively. Moreover, the efficiency and scalability of CloudViT make it an ideal candidate for deployment in mobile cloud observation systems, enabling real-time cloud image classification. The proposed hybrid architecture of CloudViT offers a promising approach for advancing ground-based cloud image classification. It holds significant potential for both optimizing performance and facilitating practical deployment scenarios.

Keywords

Image classification; ground-based cloud images; lightweight neural networks; attention mechanism; deep learning; vision transformer

Cite This Article

APA Style

Wei, D., Ge, F., Zhang, B., Zhao, Z., Li, D. et al. (2025). CloudViT: A Lightweight Ground-Based Cloud Image Classification Model with the Ability to Capture Global Features. Computers, Materials & Continua, 83(3), 5729–5746. https://doi.org/10.32604/cmc.2025.061402

Vancouver Style

Wei D, Ge F, Zhang B, Zhao Z, Li D, Xi L, et al. CloudViT: A Lightweight Ground-Based Cloud Image Classification Model with the Ability to Capture Global Features. Comput Mater Contin. 2025;83(3):5729–5746. https://doi.org/10.32604/cmc.2025.061402

IEEE Style

D. Wei et al., “CloudViT: A Lightweight Ground-Based Cloud Image Classification Model with the Ability to Capture Global Features,” Comput. Mater. Contin., vol. 83, no. 3, pp. 5729–5746, 2025. https://doi.org/10.32604/cmc.2025.061402

BibTex EndNote RIS

Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

CloudViT: A Lightweight Ground-Based Cloud Image Classification Model with the Ability to Capture Global Features

Abstract

Keywords

Cite This Article

1713

639

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link