Journal Menu

Special Issues

Table of Content

Advances in Action Recognition: Algorithms, Applications, and Emerging Trends

Submission Deadline: 01 October 2025 (closed) View: 2658 Submit to Journal

Guest Editors

Prof. Muhammad Shahid Anwar

Email: shahidanwar786@gachon.ac.kr

Affiliation: Department of AI and Software, Gachon University, 13120, South Korea

Homepage:

Research Interests: HCI, Immersive technology, QoE, Metaverse

图片3.png

Prof. Ikram Syed

Email: ikram@hufs.ac.kr

Affiliation: Dept Information & Communication Engineering, Hankuk University of Foreign Studies, Yongin, 17035, South Korea.

Homepage:

Research Interests: Machine Learning, HCI, Internet of Things

图片4.png

Summary

Action recognition is a critical area of research within computer vision and artificial intelligence, focused on the automatic identification and interpretation of human actions in videos or images. This technology has vast applications, from surveillance and security to healthcare, human-computer interaction, sports analytics, autonomous vehicles, and entertainment. Recent advances in deep learning, sensor fusion, and multimodal analysis have significantly enhanced the accuracy and efficiency of action recognition systems, opening new possibilities and challenges in both academic research and industry applications.

The special issue aims to bring together cutting-edge research contributions that address the latest developments, challenges, and future directions in the field of action recognition. This special issue will serve as a comprehensive platform for researchers and practitioners to share innovative methods, present novel applications, and discuss the technical challenges and potential solutions in the rapidly evolving landscape of action recognition.

Topics of Interest:

We invite high-quality submissions on, but not limited to, the following topics:

- Deep learning architectures (CNNs, RNNs, GNNs, Transformers) and learning techniques for action recognition.

- Multimodal action recognition using data fusion from RGB, depth, skeletal data, audio, etc.

- Real-time and efficient action recognition models for edge devices and resource-constrained environments.

- 3D and skeleton-based action recognition, including techniques leveraging human pose estimation and motion dynamics.

- Weakly supervised, zero-shot, and few-shot learning approaches for recognizing actions with limited or no labeled data.

- Action recognition in Extended Reality (XR) environments: Virtual Reality (VR), Augmented Reality (AR), and Mixed Reality (MR) applications.

- Applications in healthcare, sports analytics, entertainment, security, autonomous systems, etc.

- Development of new datasets, benchmarks, and evaluation metrics for action recognition.

- Explainability and interpretability of action recognition models, including visualization techniques and ethical considerations.

Keywords

Action Recognition; Deep Learning; Multimodal Analysis; 3D Vision; Extended Reality (XR); Real-Time Processing; Zero-Shot Learning; Human Activity Recognition; Sensor Fusion; Explainable AI

Published Papers

Show export options

Open Access

ARTICLE

A Hybrid Deep Learning Approach for IoT-Enabled Human Activity Recognition and Advanced Analytics
Shtwai Alsubai, Abdullah Al Hejaili, Najib Ben Aoun, Amina Salhi, Vincent Karovič
CMC-Computers, Materials & Continua, DOI:10.32604/cmc.2026.074057
（This article belongs to the Special Issue: Advances in Action Recognition: Algorithms, Applications, and Emerging Trends)
Abstract The concept of Human Activity Recognition (HAR) is integral to applications based on Internet of Things (IoT)-enabled devices, particularly in healthcare, fitness tracking, and smart environments. The streams of data from wearable sensors are rich in information, yet their high dimensionality and variability pose a significant challenge to proper classification. To address this problem, this paper proposes hybrid architectures that integrate traditional machine learning models with a deep neural network (DNN) to deliver improved performance and enhanced capabilities for HAR tasks. Multi-sensor HAR data were used to systematically test several hybrid models, including: RF +… More >

View
225

Download
52
Open Access

ARTICLE

DyLoRA-TAD: Dynamic Low-Rank Adapter for End-to-End Temporal Action Detection
Jixin Wu, Mingtao Zhou, Di Wu, Wenqi Ren, Jiatian Mei, Shu Zhang
CMC-Computers, Materials & Continua, Vol.86, No.3, 2026, DOI:10.32604/cmc.2025.072964
（This article belongs to the Special Issue: Advances in Action Recognition: Algorithms, Applications, and Emerging Trends)
Abstract End-to-end Temporal Action Detection (TAD) has achieved remarkable progress in recent years, driven by innovations in model architectures and the emergence of Video Foundation Models (VFMs). However, existing TAD methods that perform full fine-tuning of pretrained video models often incur substantial computational costs, which become particularly pronounced when processing long video sequences. Moreover, the need for precise temporal boundary annotations makes data labeling extremely expensive. In low-resource settings where annotated samples are scarce, direct fine-tuning tends to cause overfitting. To address these challenges, we introduce Dynamic Low-Rank Adapter (DyLoRA), a lightweight fine-tuning framework tailored specifically… More >

View
1233

Download
258
Open Access

ARTICLE

Human Motion Prediction Based on Multi-Level Spatial and Temporal Cues Learning
Jiayi Geng, Yuxuan Wu, Wenbo Lu, Pengxiang Su, Amel Ksibi, Wei Li, Zaffar Ahmed Shaikh, Di Gai
CMC-Computers, Materials & Continua, Vol.85, No.2, pp. 3689-3707, 2025, DOI:10.32604/cmc.2025.066944
（This article belongs to the Special Issue: Advances in Action Recognition: Algorithms, Applications, and Emerging Trends)
Abstract Predicting human motion based on historical motion sequences is a fundamental problem in computer vision, which is at the core of many applications. Existing approaches primarily focus on encoding spatial dependencies among human joints while ignoring the temporal cues and the complex relationships across non-consecutive frames. These limitations hinder the model’s ability to generate accurate predictions over longer time horizons and in scenarios with complex motion patterns. To address the above problems, we proposed a novel multi-level spatial and temporal learning model, which consists of a Cross Spatial Dependencies Encoding Module (CSM) and a Dynamic… More >

View
1210

Download
614
Open Access

ARTICLE

Skeleton-Based Action Recognition Using Graph Convolutional Network with Pose Correction and Channel Topology Refinement
Yuxin Gao, Xiaodong Duan, Qiguo Dai
CMC-Computers, Materials & Continua, Vol.83, No.1, pp. 701-718, 2025, DOI:10.32604/cmc.2025.060137
（This article belongs to the Special Issue: Advances in Action Recognition: Algorithms, Applications, and Emerging Trends)
Abstract Graph convolutional network (GCN) as an essential tool in human action recognition tasks have achieved excellent performance in previous studies. However, most current skeleton-based action recognition using GCN methods use a shared topology, which cannot flexibly adapt to the diverse correlations between joints under different motion features. The video-shooting angle or the occlusion of the body parts may bring about errors when extracting the human pose coordinates with estimation algorithms. In this work, we propose a novel graph convolutional learning framework, called PCCTR-GCN, which integrates pose correction and channel topology refinement for skeleton-based human action… More >

View
2226

Download
2770

Advances in Action Recognition: Algorithms, Applications, and Emerging Trends

Guest Editors

Summary

Keywords

Published Papers

View

225

Download

52

View

1233

Download

258

View

1210

Download

614

View

2226

Download

2770

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link