Home / Journals / CMC / Online First / doi:10.32604/cmc.2026.076800
Special Issues
Table of Content

Open Access

ARTICLE

Position-Wise Attention-Enhanced Vision Transformer for Diabetic Retinopathy Grading

Yan-Hao Huang*, Yu-Tse Huang
Department of Green Energy and Information Technology, National Taitung University, Taitung, Taiwan
* Corresponding Author: Yan-Hao Huang. Email: email

Computers, Materials & Continua https://doi.org/10.32604/cmc.2026.076800

Received 26 November 2025; Accepted 19 January 2026; Published online 04 February 2026

Abstract

Diabetic Retinopathy (DR) is a common microvascular complication of diabetes that progressively damages the retinal blood vessels and, without timely treatment, can lead to irreversible vision loss. In clinical practice, DR is typically diagnosed by ophthalmologists through visual inspection of fundus images, a process that is time-consuming and prone to inter- and intra-observer variability. Recent advances in artificial intelligence, particularly Convolutional Neural Networks (CNNs) and Transformer-based models, have shown strong potential for automated medical image classification and decision support. In this study, we propose a Position-Wise Attention-Enhanced Vision Transformer (PWAE-ViT), which integrates a positional attention module into the standard ViT architecture to strengthen spatial positional information and feature representation across image patches. The proposed module encourages the network to better model local and global contextual relationships, thereby improving DR grading performance. To evaluate the robustness of our model, experiments were conducted on two public retinal fundus image datasets: APTOS-2019 and Indian Diabetic Retinopathy Image Dataset (IDRiD). The proposed PWAE-ViT consistently outperforms the baseline ViT model, achieving classification accuracies of 84% and 62% on the APTOS-2019 and IDRiD datasets, respectively. These results demonstrate more accurate and reliable DR severity classification, offering a promising tool to assist clinicians in screening and diagnosis.

Keywords

Transformer; attention mechanisms; medical imaging; diabetic retinopathy
  • 68

    View

  • 12

    Download

  • 0

    Like

Share Link