Home / Journals / CMC / Online First / doi:10.32604/cmc.2026.075316
Special Issues
Table of Content

Open Access

REVIEW

Attention-Based Medical Image Analysis: Architectures, Applications, and Future Directions

Xinjie Yao1, Junjie Zhu2, Tao Hong3,4, Dengyu Zhao5, Weikai Liu6, Guangsheng Xie7,*
1 Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China
2 School of Artificial Intelligence, Xiangyang Polytechnic University, Xiangyang, China
3 China Nuclear Power Engineering Co., Ltd., Beijing, China
4 CNNC Engineering Research Center for Fuel Reprocessing, Beijing, China
5 School of Mathematics, Tianjin University, Tianjin, China
6 Qingdao Institute of Software, College of Computer Science and Technology, China University of Petroleum (East China), Qingdao, China
7 Faculty of Surveying and Information Engineering, West Yunnan University of Applied Sciences, Yunnan, China
* Corresponding Author: Guangsheng Xie. Email: email
(This article belongs to the Special Issue: Attention Mechanism-based Complex System Pattern Intelligent Recognition and Accurate Prediction)

Computers, Materials & Continua https://doi.org/10.32604/cmc.2026.075316

Received 29 October 2025; Accepted 13 February 2026; Published online 18 May 2026

Abstract

The attention mechanism, as a key technology for enhancing the performance of deep learning, is gaining increasingly widespread attention in medical image analysis due to its ability to focus on critical features and suppress redundant information. In recent years, the continuous evolution of attention methods has significantly improved their accuracy and robustness in key medical tasks such as lesion detection, tissue segmentation, and multimodal fusion, providing crucial support for building reliable clinical decision support systems. This paper systematically reviews the advances in attention-based methods for medical image analysis, comparing their performance with mainstream models like U-shaped convolutional neural network (UNet), convolutional neural network (CNN), and Vision Transformer (ViT) across multiple tasks, and summarizes various evaluation metrics, including the Dice coefficient, area under the ROC (Receiver Operating Characteristic) curve (AUC), and mean Average Precision (mAP). The review highlights that attention mechanisms bring significant improvements across core tasks such as segmentation, classification, detection, registration, and multimodal fusion: the Dice coefficient increases by 5%–12% in segmentation tasks, AUC improves by 3%–8% in classification tasks, mAP rises by 7%–15% in detection tasks, alignment accuracy enhances by 10%–20% in registration tasks, and retrieval accuracy reaches 85%–95% in multimodal fusion. The design characteristics and performance gains of key architectures such as channel attention, spatial attention, and hybrid attention are further analyzed. Nevertheless, current research still faces critical challenges, including scarce annotated data, limited cross-center generalization, complexity in multimodal fusion, insufficient model interpretability, and high computational costs. Accordingly, future research directions are proposed to promote the in-depth development and clinical translation of attention mechanisms in medical image analysis.

Keywords

Attention mechanism; medical imaging; deep learning; multimodal fusion; explainability
  • 41

    View

  • 8

    Download

  • 0

    Like

Share Link