Open Access iconOpen Access

ARTICLE

Position-Wise Attention-Enhanced Vision Transformer for Diabetic Retinopathy Grading

Yan-Hao Huang*, Yu-Tse Huang

Department of Green Energy and Information Technology, National Taitung University, Taitung, Taiwan

* Corresponding Author: Yan-Hao Huang. Email: email

Computers, Materials & Continua 2026, 87(3), 45 https://doi.org/10.32604/cmc.2026.076800

Abstract

Diabetic Retinopathy (DR) is a common microvascular complication of diabetes that progressively damages the retinal blood vessels and, without timely treatment, can lead to irreversible vision loss. In clinical practice, DR is typically diagnosed by ophthalmologists through visual inspection of fundus images, a process that is time-consuming and prone to inter- and intra-observer variability. Recent advances in artificial intelligence, particularly Convolutional Neural Networks (CNNs) and Transformer-based models, have shown strong potential for automated medical image classification and decision support. In this study, we propose a Position-Wise Attention-Enhanced Vision Transformer (PWAE-ViT), which integrates a positional attention module into the standard ViT architecture to strengthen spatial positional information and feature representation across image patches. The proposed module encourages the network to better model local and global contextual relationships, thereby improving DR grading performance. To evaluate the robustness of our model, experiments were conducted on two public retinal fundus image datasets: APTOS-2019 and Indian Diabetic Retinopathy Image Dataset (IDRiD). The proposed PWAE-ViT consistently outperforms the baseline ViT model, achieving classification accuracies of 84% and 62% on the APTOS-2019 and IDRiD datasets, respectively. These results demonstrate more accurate and reliable DR severity classification, offering a promising tool to assist clinicians in screening and diagnosis.

Keywords

Transformer; attention mechanisms; medical imaging; diabetic retinopathy

Cite This Article

APA Style
Huang, Y., Huang, Y. (2026). Position-Wise Attention-Enhanced Vision Transformer for Diabetic Retinopathy Grading. Computers, Materials & Continua, 87(3), 45. https://doi.org/10.32604/cmc.2026.076800
Vancouver Style
Huang Y, Huang Y. Position-Wise Attention-Enhanced Vision Transformer for Diabetic Retinopathy Grading. Comput Mater Contin. 2026;87(3):45. https://doi.org/10.32604/cmc.2026.076800
IEEE Style
Y. Huang and Y. Huang, “Position-Wise Attention-Enhanced Vision Transformer for Diabetic Retinopathy Grading,” Comput. Mater. Contin., vol. 87, no. 3, pp. 45, 2026. https://doi.org/10.32604/cmc.2026.076800



cc Copyright © 2026 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 639

    View

  • 333

    Download

  • 0

    Like

Share Link