From Algorithm to Expert: RLHF-Guided Vision-Language Model for 3D-EEM Fluorescence Spectroscopy Matching

Chenglong Lu¹, Jiehui Li¹, Tonglin Chen^1,2,*, Changhua Zhou¹, Yixin Fan¹, Xinlin Ren¹, Ziyi Ju¹, Wei Wang¹
1 College of Computer Science and Artificial Intelligence, Fudan University, Shanghai, China
2 The China Railway 24th Bureau Group Corporation Limited, Shanghai, China
* Corresponding Author: Tonglin Chen. Email: email

Computers, Materials & Continua https://doi.org/10.32604/cmc.2026.075400

Received 31 October 2025; Accepted 21 January 2026; Published online 13 February 2026

Download PDF

Abstract

Existing methods for tracing water pollution sources typically integrate three-dimensional excitation-emission matrix (3D-EEM) fluorescence spectroscopy with similarity-based matching algorithms. However, these approaches exhibit high error rates in borderline cases and necessitate expert manual review, which limits scalability and introduces inconsistencies between algorithmic outputs and expert judgment. To address these limitations, we propose a large vision-language model (VLM) designed as an “expert agent” to automatically refine similarity scores, ensuring alignment with expert decisions and overcoming key application bottlenecks. The model consists of two core components: (1) rule-based similarity calculation module generate initial spectral similarity scores, and (2) pre-trained large vision-language model fine-tuned via supervised learning and reinforcement learning with human feedback (RLHF) to emulate expert assessments. To facilitate training and evaluation, we introduce two expert-annotated datasets, Spec1k and SpecReason, which capture both quantitative corrections and qualitative reasoning patterns, allowing the model to emulate expert decision-making processes. Experimental results demonstrate that our method achieves 81.45% source attribution accuracy, 38.24% higher than rule-based and machine learning baselines. Real-world deployment further validates its effectiveness.

Keywords

Vision-language model; reinforcement learning with human feedback; pollution source tracing; 3D fluorescence spectroscopy

Downloads
- Full-Text PDF
Citation Tools
- BibTex
- EndNote
- RIS

132

View
33

Download
0

Like

A Review on Vision-Language-Based Approaches: Challenges and Applications
Huu-Tuong Ho, Luong Vuong Nguyen,...
Rethinking Chart Understanding Using Multimodal Large Language Models
Andreea-Maria Tanasă, Simona-Vasilica...
Proactive Disentangled Modeling of Trigger–Object Pairings for Backdoor Defense
Kyle Stein, Andrew A. Mahyari,...
Research on Automated Game QA Reporting Based on Natural Language Captions
Jun Myeong Kim, Jang Young Jeong,...
Industrial EdgeSign: NAS-Optimized Real-Time Hand Gesture Recognition for Operator Communication in Smart Factories
Meixi Chu, Xinyu Jiang, Yushu...

All issues

Online First

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

From Algorithm to Expert: RLHF-Guided Vision-Language Model for 3D-EEM Fluorescence Spectroscopy Matching

Abstract

Keywords

132

33

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link