Home / Journals / CMC / Online First / doi:10.32604/cmc.2026.075400
Special Issues
Table of Content

Open Access

ARTICLE

From Algorithm to Expert: RLHF-Guided Vision-Language Model for 3D-EEM Fluorescence Spectroscopy Matching

Chenglong Lu1, Jiehui Li1, Tonglin Chen1,2,*, Changhua Zhou1, Yixin Fan1, Xinlin Ren1, Ziyi Ju1, Wei Wang1
1 College of Computer Science and Artificial Intelligence, Fudan University, Shanghai, China
2 The China Railway 24th Bureau Group Corporation Limited, Shanghai, China
* Corresponding Author: Tonglin Chen. Email: email

Computers, Materials & Continua https://doi.org/10.32604/cmc.2026.075400

Received 31 October 2025; Accepted 21 January 2026; Published online 13 February 2026

Abstract

Existing methods for tracing water pollution sources typically integrate three-dimensional excitation-emission matrix (3D-EEM) fluorescence spectroscopy with similarity-based matching algorithms. However, these approaches exhibit high error rates in borderline cases and necessitate expert manual review, which limits scalability and introduces inconsistencies between algorithmic outputs and expert judgment. To address these limitations, we propose a large vision-language model (VLM) designed as an “expert agent” to automatically refine similarity scores, ensuring alignment with expert decisions and overcoming key application bottlenecks. The model consists of two core components: (1) rule-based similarity calculation module generate initial spectral similarity scores, and (2) pre-trained large vision-language model fine-tuned via supervised learning and reinforcement learning with human feedback (RLHF) to emulate expert assessments. To facilitate training and evaluation, we introduce two expert-annotated datasets, Spec1k and SpecReason, which capture both quantitative corrections and qualitative reasoning patterns, allowing the model to emulate expert decision-making processes. Experimental results demonstrate that our method achieves 81.45% source attribution accuracy, 38.24% higher than rule-based and machine learning baselines. Real-world deployment further validates its effectiveness.

Keywords

Vision-language model; reinforcement learning with human feedback; pollution source tracing; 3D fluorescence spectroscopy
  • 132

    View

  • 33

    Download

  • 0

    Like

Share Link