Open Access iconOpen Access

ARTICLE

crossmark

Multi-Stage Vision Transformer and Knowledge Graph Fusion for Enhanced Plant Disease Classification

Wafaa H. Alwan1,*, Sabah M. Alturfi2

1 Computer Science, Faculty of Computer Science & Information Technology, University of Karbala, Karbala, 56001, Iraq
2 Law College, Karbala University, Karbala, 56001, Iraq

* Corresponding Author: Wafaa H. Alwan. Email: email

Computer Systems Science and Engineering 2025, 49, 419-434. https://doi.org/10.32604/csse.2025.064195

Abstract

Plant diseases pose a significant challenge to global agricultural productivity, necessitating efficient and precise diagnostic systems for early intervention and mitigation. In this study, we propose a novel hybrid framework that integrates EfficientNet-B8, Vision Transformer (ViT), and Knowledge Graph Fusion (KGF) to enhance plant disease classification across 38 distinct disease categories. The proposed framework leverages deep learning and semantic enrichment to improve classification accuracy and interpretability. EfficientNet-B8, a convolutional neural network (CNN) with optimized depth and width scaling, captures fine-grained spatial details in high-resolution plant images, aiding in the detection of subtle disease symptoms. In parallel, ViT, a transformer-based architecture, effectively models long-range dependencies and global structural patterns within the images, ensuring robust disease pattern recognition. Furthermore, KGF incorporates domain-specific metadata, such as crop type, environmental conditions, and disease relationships, to provide contextual intelligence and improve classification accuracy. The proposed model was rigorously evaluated on a large-scale dataset containing diverse plant disease images, achieving outstanding performance with a 99.7% training accuracy and 99.3% testing accuracy. The precision and F1-score were consistently high across all disease classes, demonstrating the framework’s ability to minimize false positives and false negatives. Compared to conventional deep learning approaches, this hybrid method offers a more comprehensive and interpretable solution by integrating self-attention mechanisms and domain knowledge. Beyond its superior classification performance, this model opens avenues for optimizing metadata dependency and reducing computational complexity, making it more feasible for real-world deployment in resource-constrained agricultural settings. The proposed framework represents an advancement in precision agriculture, providing scalable, intelligent disease diagnosis that enhances crop protection and food security.

Keywords

Plant disease classification; EfficientNet-B8; vision transformer; knowledge graph fusion; precision agriculture; deep learning; contextual metadata

Cite This Article

APA Style
Alwan, W.H., Alturfi, S.M. (2025). Multi-Stage Vision Transformer and Knowledge Graph Fusion for Enhanced Plant Disease Classification. Computer Systems Science and Engineering, 49(1), 419–434. https://doi.org/10.32604/csse.2025.064195
Vancouver Style
Alwan WH, Alturfi SM. Multi-Stage Vision Transformer and Knowledge Graph Fusion for Enhanced Plant Disease Classification. Comput Syst Sci Eng. 2025;49(1):419–434. https://doi.org/10.32604/csse.2025.064195
IEEE Style
W. H. Alwan and S. M. Alturfi, “Multi-Stage Vision Transformer and Knowledge Graph Fusion for Enhanced Plant Disease Classification,” Comput. Syst. Sci. Eng., vol. 49, no. 1, pp. 419–434, 2025. https://doi.org/10.32604/csse.2025.064195



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 115

    View

  • 67

    Download

  • 0

    Like

Share Link