A Dual-Layer Attention Based CAPTCHA Recognition Approach with Guided Visual Attention

Zaid Derea; Beiji Zou; Xiaoyan Kui; Alaa Thobhani; Amr Abdussalam

doi:10.32604/cmes.2025.059586

Open Access icon Open Access

ARTICLE

A Dual-Layer Attention Based CAPTCHA Recognition Approach with Guided Visual Attention

Zaid Derea^1,2, Beiji Zou¹, Xiaoyan Kui^1,*, Alaa Thobhani¹, Amr Abdussalam³

1 School of Computer Science and Engineering, Central South University, Changsha, 410083, China
2 College of Computer Science and Information Technology, Wasit University, Wasit, 52001, Iraq
3 Electronic Engineering and Information Science Department, University of Science and Technology of China, Hefei, 230026, China

* Corresponding Author: Xiaoyan Kui. Email: email

(This article belongs to the Special Issue: Recent Advances in Signal Processing and Computer Vision)

Computer Modeling in Engineering & Sciences 2025, 142(3), 2841-2867. https://doi.org/10.32604/cmes.2025.059586

Received 12 October 2024; Accepted 10 January 2025; Issue published 03 March 2025

Abstract

Enhancing website security is crucial to combat malicious activities, and CAPTCHA (Completely Automated Public Turing tests to tell Computers and Humans Apart) has become a key method to distinguish humans from bots. While text-based CAPTCHAs are designed to challenge machines while remaining human-readable, recent advances in deep learning have enabled models to recognize them with remarkable efficiency. In this regard, we propose a novel two-layer visual attention framework for CAPTCHA recognition that builds on traditional attention mechanisms by incorporating Guided Visual Attention (GVA), which sharpens focus on relevant visual features. We have specifically adapted the well-established image captioning task to address this need. Our approach utilizes the first-level attention module as guidance to the second-level attention component, incorporating two LSTM (Long Short-Term Memory) layers to enhance CAPTCHA recognition. Our extensive evaluation across four diverse datasets—Weibo, BoC (Bank of China), Gregwar, and Captcha 0.3—shows the adaptability and efficacy of our method. Our approach demonstrated impressive performance, achieving an accuracy of 96.70% for BoC and 95.92% for Webo. These results underscore the effectiveness of our method in accurately recognizing and processing CAPTCHA datasets, showcasing its robustness, reliability, and ability to handle varied challenges in CAPTCHA recognition.

Keywords

Text-based CAPTCHA image recognition; guided visual attention; web security; computer vision

Cite This Article

APA Style

Derea, Z., Zou, B., Kui, X., Thobhani, A., Abdussalam, A. (2025). A Dual-Layer Attention Based CAPTCHA Recognition Approach with Guided Visual Attention. Computer Modeling in Engineering & Sciences, 142(3), 2841–2867. https://doi.org/10.32604/cmes.2025.059586

Vancouver Style

Derea Z, Zou B, Kui X, Thobhani A, Abdussalam A. A Dual-Layer Attention Based CAPTCHA Recognition Approach with Guided Visual Attention. Comput Model Eng Sci. 2025;142(3):2841–2867. https://doi.org/10.32604/cmes.2025.059586

IEEE Style

Z. Derea, B. Zou, X. Kui, A. Thobhani, and A. Abdussalam, “A Dual-Layer Attention Based CAPTCHA Recognition Approach with Guided Visual Attention,” Comput. Model. Eng. Sci., vol. 142, no. 3, pp. 2841–2867, 2025. https://doi.org/10.32604/cmes.2025.059586

BibTex EndNote RIS

Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

A Dual-Layer Attention Based CAPTCHA Recognition Approach with Guided Visual Attention

Abstract

Keywords

Cite This Article

1484

633

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link