Open Access iconOpen Access

ARTICLE

crossmark

A Novel Attention-Based Parallel Blocks Deep Architecture for Human Action Recognition

Yasir Khan Jadoon1, Yasir Noman Khalid1, Muhammad Attique Khan2, Jungpil Shin3,*, Fatimah Alhayan4, Hee-Chan Cho5, Byoungchol Chang6,*

1 Department of Computer Engineering, HITECUniversity, Taxila, 47080, Pakistan
2 Deparment of AI, Prince Mohammad bin Fahd University, Al-Khobar, 31952, Saudi Arabia
3 Department of Computer Science and Engineering, University of Aizu, AizuWakamatsu, Fukushima, 965-0006, Japan
4 Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh, 11671, Saudi Arabia
5 Center for Computational Social Science, Hanyang University, Seoul, 01000, Republic of Korea
6 Department of Computer Science, Hanyang University, Seoul, 01000, Republic of Korea

* Corresponding Authors: Jungpil Shin. Email: email; Byoungchol Chang. Email: email

(This article belongs to the Special Issue: Machine Learning and Deep Learning-Based Pattern Recognition)

Computer Modeling in Engineering & Sciences 2025, 144(1), 1143-1164. https://doi.org/10.32604/cmes.2025.066984

Abstract

Real-time surveillance is attributed to recognizing the variety of actions performed by humans. Human Action Recognition (HAR) is a technique that recognizes human actions from a video stream. A range of variations in human actions makes it difficult to recognize with considerable accuracy. This paper presents a novel deep neural network architecture called Attention RB-Net for HAR using video frames. The input is provided to the model in the form of video frames. The proposed deep architecture is based on the unique structuring of residual blocks with several filter sizes. Features are extracted from each frame via several operations with specific parameters defined in the presented novel Attention-based Residual Bottleneck (Attention-RB) DCNN architecture. A fully connected layer receives an attention-based features matrix, and final classification is performed. Several hyperparameters of the proposed model are initialized using Bayesian Optimization (BO) and later utilized in the trained model for testing. In testing, features are extracted from the self-attention layer and passed to neural network classifiers for the final action classification. Two highly cited datasets, HMDB51 and UCF101, were used to validate the proposed architecture and obtained an average accuracy of 87.70% and 97.30%, respectively. The deep convolutional neural network (DCNN) architecture is compared with state-of-the-art (SOTA) methods, including pre-trained models, inside blocks, and recently published techniques, and performs better.

Keywords

Human action recognition; self-attention; video streams; residual bottleneck; classification; neural networks

Cite This Article

APA Style
Jadoon, Y.K., Khalid, Y.N., Khan, M.A., Shin, J., Alhayan, F. et al. (2025). A Novel Attention-Based Parallel Blocks Deep Architecture for Human Action Recognition. Computer Modeling in Engineering & Sciences, 144(1), 1143–1164. https://doi.org/10.32604/cmes.2025.066984
Vancouver Style
Jadoon YK, Khalid YN, Khan MA, Shin J, Alhayan F, Cho H, et al. A Novel Attention-Based Parallel Blocks Deep Architecture for Human Action Recognition. Comput Model Eng Sci. 2025;144(1):1143–1164. https://doi.org/10.32604/cmes.2025.066984
IEEE Style
Y. K. Jadoon et al., “A Novel Attention-Based Parallel Blocks Deep Architecture for Human Action Recognition,” Comput. Model. Eng. Sci., vol. 144, no. 1, pp. 1143–1164, 2025. https://doi.org/10.32604/cmes.2025.066984



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 735

    View

  • 628

    Download

  • 0

    Like

Share Link