Home / Journals / CMC / Online First / doi:10.32604/cmc.2025.074409
Special Issues
Table of Content

Open Access

REVIEW

A Review of Foundation Models for Multi-Task Agricultural Question Answering

Changxu Zhao1, Jianping Liu1,*, Xiaofeng Wang1, Wei Sun2, Libo Liu3, Haiyu Ren1, Pan Liu1, Qiantong Wang1
1 School of Computer Science and Engineering, North Minzu University, Yinchuan, 750021, China
2 Institute of Plant Protection, Ningxia Academy of Agriculture and Forestry Sciences, Yinchuan, 750002, China
3 School of Information Engineer, Ningxia University, Yinchuan, 750021, China
* Corresponding Author: Jianping Liu. Email: email

Computers, Materials & Continua https://doi.org/10.32604/cmc.2025.074409

Received 10 October 2025; Accepted 12 December 2025; Published online 06 January 2026

Abstract

Foundation models are reshaping artificial intelligence, yet their deployment in specialised domains such as agricultural question answering (AQA) still faces challenges including data scarcity and barriers to domain-specific knowledge. To systematically review recent progress in this area, this paper adopts a task–paradigm perspective and examines applications across three major AQA task families. For text-based QA, we analyse the strengths and limitations of retrieval-based, generative, and hybrid approaches built on large language models, revealing a clear trend toward hybrid paradigms that balance precision and flexibility. For visual diagnosis, we discuss techniques such as cross-modal alignment and prompt-driven generation, which are pushing systems beyond simple pest and disease recognition toward deeper causal reasoning. For multimodal reasoning, we show how the fusion of heterogeneous data—including text, images, speech, and sensor streams—enables comprehensive decision-making for diagnosis, monitoring, and yield prediction. To address the lack of unified benchmarks, we further propose a standardised evaluation protocol and a diagnostic taxonomy specifically designed to characterise agriculture-specific errors. Finally, we outline a concrete AQA roadmap that emphasises safety alignment, hallucination control, and lightweight deployment, aiming to guide future systems toward greater efficiency, trustworthiness, and sustainability.

Keywords

Foundation models; agricultural question answering; multimodal learning; large language models; smart agriculture; artificial intelligence
  • 93

    View

  • 14

    Download

  • 1

    Like

Share Link