Position-Wise Attention-Enhanced Vision Transformer for Diabetic Retinopathy Grading

Yan-Hao Huang; Yu-Tse Huang

doi:10.32604/cmc.2026.076800

Open Access icon Open Access

ARTICLE

Position-Wise Attention-Enhanced Vision Transformer for Diabetic Retinopathy Grading

Yan-Hao Huang^*, Yu-Tse Huang

Department of Green Energy and Information Technology, National Taitung University, Taitung, Taiwan

* Corresponding Author: Yan-Hao Huang. Email: email

Computers, Materials & Continua 2026, 87(3), 45 https://doi.org/10.32604/cmc.2026.076800

Received 26 November 2025; Accepted 19 January 2026; Issue published 09 April 2026

Abstract

Diabetic Retinopathy (DR) is a common microvascular complication of diabetes that progressively damages the retinal blood vessels and, without timely treatment, can lead to irreversible vision loss. In clinical practice, DR is typically diagnosed by ophthalmologists through visual inspection of fundus images, a process that is time-consuming and prone to inter- and intra-observer variability. Recent advances in artificial intelligence, particularly Convolutional Neural Networks (CNNs) and Transformer-based models, have shown strong potential for automated medical image classification and decision support. In this study, we propose a Position-Wise Attention-Enhanced Vision Transformer (PWAE-ViT), which integrates a positional attention module into the standard ViT architecture to strengthen spatial positional information and feature representation across image patches. The proposed module encourages the network to better model local and global contextual relationships, thereby improving DR grading performance. To evaluate the robustness of our model, experiments were conducted on two public retinal fundus image datasets: APTOS-2019 and Indian Diabetic Retinopathy Image Dataset (IDRiD). The proposed PWAE-ViT consistently outperforms the baseline ViT model, achieving classification accuracies of 84% and 62% on the APTOS-2019 and IDRiD datasets, respectively. These results demonstrate more accurate and reliable DR severity classification, offering a promising tool to assist clinicians in screening and diagnosis.

Keywords

Transformer; attention mechanisms; medical imaging; diabetic retinopathy

Cite This Article

APA Style

Huang, Y., Huang, Y. (2026). Position-Wise Attention-Enhanced Vision Transformer for Diabetic Retinopathy Grading. Computers, Materials & Continua, 87(3), 45. https://doi.org/10.32604/cmc.2026.076800

Vancouver Style

Huang Y, Huang Y. Position-Wise Attention-Enhanced Vision Transformer for Diabetic Retinopathy Grading. Comput Mater Contin. 2026;87(3):45. https://doi.org/10.32604/cmc.2026.076800

IEEE Style

Y. Huang and Y. Huang, “Position-Wise Attention-Enhanced Vision Transformer for Diabetic Retinopathy Grading,” Comput. Mater. Contin., vol. 87, no. 3, pp. 45, 2026. https://doi.org/10.32604/cmc.2026.076800

BibTex EndNote RIS

Copyright © 2026 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Position-Wise Attention-Enhanced Vision Transformer for Diabetic Retinopathy Grading

Abstract

Keywords

Cite This Article

1270

678

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link