Two-Stream Deep Learning Architecture-Based Human Action Recognition

Faheem Shehzad; Muhammad Khan; Muhammad Asfand; Muhammad Sharif; Majed Alhaisoni; Usman Tariq; Arnab Majumdar; Orawit Thinnukool

doi:10.32604/cmc.2023.028743

Open Access icon Open Access

ARTICLE

Two-Stream Deep Learning Architecture-Based Human Action Recognition

Faheem Shehzad¹, Muhammad Attique Khan², Muhammad Asfand E. Yar³, Muhammad Sharif¹, Majed Alhaisoni⁴, Usman Tariq⁵, Arnab Majumdar⁶, Orawit Thinnukool^7,*

1 Department of Computer Science, COMSATS University Islamabad, Wah Campus, Pakistan
2 Department of Computer Science, HITEC University, Taxila, Pakistan
3 Department of Computer Science, Bahria University, Islamabad, Pakistan
4 Computer Sciences Department, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, 11671, Saudi Arabia
5 College of Computer Engineering and Science, Prince Sattam Bin Abdulaziz University, Al-Kharaj, 11942, Saudi Arabia
6 Faculty of Engineering, Imperial College London, London, SW7 2AZ, UK
7 College of Arts, Media, and Technology, Chiang Mai University, Chiang Mai, 50200, Thailand

* Corresponding Author: Orawit Thinnukool. Email: email

Computers, Materials & Continua 2023, 74(3), 5931-5949. https://doi.org/10.32604/cmc.2023.028743

Received 16 February 2022; Accepted 06 May 2022; Issue published 28 December 2022

Abstract

Human action recognition (HAR) based on Artificial intelligence reasoning is the most important research area in computer vision. Big breakthroughs in this field have been observed in the last few years; additionally, the interest in research in this field is evolving, such as understanding of actions and scenes, studying human joints, and human posture recognition. Many HAR techniques are introduced in the literature. Nonetheless, the challenge of redundant and irrelevant features reduces recognition accuracy. They also faced a few other challenges, such as differing perspectives, environmental conditions, and temporal variations, among others. In this work, a deep learning and improved whale optimization algorithm based framework is proposed for HAR. The proposed framework consists of a few core stages i.e., frames initial preprocessing, fine-tuned pre-trained deep learning models through transfer learning (TL), features fusion using modified serial based approach, and improved whale optimization based best features selection for final classification. Two pre-trained deep learning models such as InceptionV3 and Resnet101 are fine-tuned and TL is employed to train on action recognition datasets. The fusion process increases the length of feature vectors; therefore, improved whale optimization algorithm is proposed and selects the best features. The best selected features are finally classified using machine learning (ML) classifiers. Four publicly accessible datasets such as Ut-interaction, Hollywood, Free Viewpoint Action Recognition using Motion History Volumes (IXMAS), and centre of computer vision (UCF) Sports, are employed and achieved the testing accuracy of 100%, 99.9%, 99.1%, and 100% respectively. Comparison with state of the art techniques (SOTA), the proposed method showed the improved accuracy.

Keywords

Human action recognition; deep learning; transfer learning; fusion of multiple features; features optimization

Cite This Article

APA Style

Shehzad, F., Khan, M.A., Yar, M.A.E., Sharif, M., Alhaisoni, M. et al. (2023). Two-stream deep learning architecture-based human action recognition. Computers, Materials & Continua, 74(3), 5931-5949. https://doi.org/10.32604/cmc.2023.028743

Vancouver Style

Shehzad F, Khan MA, Yar MAE, Sharif M, Alhaisoni M, Tariq U, et al. Two-stream deep learning architecture-based human action recognition. Comput Mater Contin. 2023;74(3):5931-5949 https://doi.org/10.32604/cmc.2023.028743

IEEE Style

F. Shehzad et al., "Two-Stream Deep Learning Architecture-Based Human Action Recognition," Comput. Mater. Contin., vol. 74, no. 3, pp. 5931-5949. 2023. https://doi.org/10.32604/cmc.2023.028743

BibTex EndNote RIS

This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Two-Stream Deep Learning Architecture-Based Human Action Recognition

Abstract

Keywords

Cite This Article

1466

501

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link