Open Access iconOpen Access

ARTICLE

crossmark

Image Enhancement Combined with LLM Collaboration for Low-Contrast Image Character Recognition

Qin Qin1, Xuan Jiang1,*, Jinhua Jiang1, Dongfang Zhao1, Zimei Tu1, Zhiwei Shen2

1 School of Intelligent Manufacturing and Control Engineering, Shanghai Polytechnic University, Shanghai, 201209, China
2 School of Electrical Engineering and Telecommunications, UNSW Sydney, Sydney, NSW 2052, Australia

* Corresponding Author: Xuan Jiang. Email: email

Computers, Materials & Continua 2025, 85(3), 4849-4867. https://doi.org/10.32604/cmc.2025.067919

Abstract

The effectiveness of industrial character recognition on cast steel is often compromised by factors such as corrosion, surface defects, and low contrast, which hinder the extraction of reliable visual information. The problem is further compounded by the scarcity of large-scale annotated datasets and complex noise patterns in real-world factory environments. This makes conventional OCR techniques and standard deep learning models unreliable. To address these limitations, this study proposes a unified framework that integrates adaptive image preprocessing with collaborative reasoning among LLMs. A Biorthogonal 4.4 (bior4.4) wavelet transform is adaptively tuned using DE to enhance character edge clarity, suppress background noise, and retain morphological structure, thereby improving input quality for subsequent recognition. A structured three-round debate mechanism is further introduced within a multi-agent architecture, employing GPT-4o and Gemini-2.0-flash as role-specialized agents to perform complementary inference and achieve consensus. The proposed system is evaluated on a proprietary dataset of 48 high-resolution images collected under diverse industrial conditions. Experimental results show that the combination of DE-based enhancement and multi-agent collaboration consistently outperforms traditional baselines and ablated models, achieving an F1-score of 94.93% and an LCS accuracy of 93.30%. These results demonstrate the effectiveness of integrating signal processing with multi-agent LLM reasoning to achieve robust and interpretable OCR in visually complex and data-scarce industrial environments.

Keywords

Low-contrast images; differential evolution (DE); wavelet transform; multi-agent systems; large language models (LLMs)

Cite This Article

APA Style
Qin, Q., Jiang, X., Jiang, J., Zhao, D., Tu, Z. et al. (2025). Image Enhancement Combined with LLM Collaboration for Low-Contrast Image Character Recognition. Computers, Materials & Continua, 85(3), 4849–4867. https://doi.org/10.32604/cmc.2025.067919
Vancouver Style
Qin Q, Jiang X, Jiang J, Zhao D, Tu Z, Shen Z. Image Enhancement Combined with LLM Collaboration for Low-Contrast Image Character Recognition. Comput Mater Contin. 2025;85(3):4849–4867. https://doi.org/10.32604/cmc.2025.067919
IEEE Style
Q. Qin, X. Jiang, J. Jiang, D. Zhao, Z. Tu, and Z. Shen, “Image Enhancement Combined with LLM Collaboration for Low-Contrast Image Character Recognition,” Comput. Mater. Contin., vol. 85, no. 3, pp. 4849–4867, 2025. https://doi.org/10.32604/cmc.2025.067919



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 556

    View

  • 174

    Download

  • 0

    Like

Share Link