Open Access iconOpen Access

ARTICLE

Enhanced Plant Species Identification through Metadata Fusion and Vision Transformer Integration

Hassan Javed1, Labiba Gillani Fahad1, Syed Fahad Tahir2,*, Mehdi Hassan2, Hani Alquhayz3

1 Department of Computer Science, National University of Computer & Emerging Sciences, Islamabad, 44000, Pakistan
2 Department of Computer Science, Air University, Islamabad, 44000, Pakistan
3 Department of Computer Science and Information, College of Science in Zulfi, Majmaah University, Al-Majmaah, 11952, Saudi Arabia

* Corresponding Author: Syed Fahad Tahir. Email: email

Computers, Materials & Continua 2025, 85(2), 3981-3996. https://doi.org/10.32604/cmc.2025.064359

Abstract

Accurate plant species classification is essential for many applications, such as biodiversity conservation, ecological research, and sustainable agricultural practices. Traditional morphological classification methods are inherently slow, labour-intensive, and prone to inaccuracies, especially when distinguishing between species exhibiting visual similarities or high intra-species variability. To address these limitations and to overcome the constraints of image-only approaches, we introduce a novel Artificial Intelligence-driven framework. This approach integrates robust Vision Transformer (ViT) models for advanced visual analysis with a multi-modal data fusion strategy, incorporating contextual metadata such as precise environmental conditions, geographic location, and phenological traits. This combination of visual and ecological cues significantly enhances classification accuracy and robustness, proving especially vital in complex, heterogeneous real-world environments. The proposed model achieves an impressive 97.27% of test accuracy, and Mean Reciprocal Rank (MRR) of 0.9842 that demonstrates strong generalization capabilities. Furthermore, efficient utilization of high-performance GPU resources (RTX 3090, 18 GB memory) ensures scalable processing of high-dimensional data. Comparative analysis consistently confirms that our metadata fusion approach substantially improves classification performance, particularly for morphologically similar species, and through principled self-supervised and transfer learning from ImageNet, the model adapts efficiently to new species, ensuring enhanced generalization. This comprehensive approach holds profound practical implications for precise conservation initiatives, rigorous ecological monitoring, and advanced agricultural management.

Keywords

Vision transformers (ViTs); transformers; machine learning; deep learning; plant species; classification; multi-organ

Cite This Article

APA Style
Javed, H., Fahad, L.G., Tahir, S.F., Hassan, M., Alquhayz, H. (2025). Enhanced Plant Species Identification through Metadata Fusion and Vision Transformer Integration. Computers, Materials & Continua, 85(2), 3981–3996. https://doi.org/10.32604/cmc.2025.064359
Vancouver Style
Javed H, Fahad LG, Tahir SF, Hassan M, Alquhayz H. Enhanced Plant Species Identification through Metadata Fusion and Vision Transformer Integration. Comput Mater Contin. 2025;85(2):3981–3996. https://doi.org/10.32604/cmc.2025.064359
IEEE Style
H. Javed, L. G. Fahad, S. F. Tahir, M. Hassan, and H. Alquhayz, “Enhanced Plant Species Identification through Metadata Fusion and Vision Transformer Integration,” Comput. Mater. Contin., vol. 85, no. 2, pp. 3981–3996, 2025. https://doi.org/10.32604/cmc.2025.064359



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 786

    View

  • 463

    Download

  • 0

    Like

Share Link