Special Issues
Table of Content

Advances in Action Recognition: Algorithms, Applications, and Emerging Trends

Submission Deadline: 01 October 2025 (closed) View: 2243 Submit to Journal

Guest Editors

Prof. Muhammad Shahid Anwar

Email: shahidanwar786@gachon.ac.kr

Affiliation: Department of AI and Software, Gachon University, 13120, South Korea

Homepage:

Research Interests: HCI, Immersive technology, QoE, Metaverse

图片3.png


Prof. Ikram Syed

Email: ikram@hufs.ac.kr

Affiliation: Dept Information & Communication Engineering, Hankuk University of Foreign Studies, Yongin, 17035, South Korea.

Homepage:

Research Interests: Machine Learning, HCI, Internet of Things

图片4.png


Summary

Action recognition is a critical area of research within computer vision and artificial intelligence, focused on the automatic identification and interpretation of human actions in videos or images. This technology has vast applications, from surveillance and security to healthcare, human-computer interaction, sports analytics, autonomous vehicles, and entertainment. Recent advances in deep learning, sensor fusion, and multimodal analysis have significantly enhanced the accuracy and efficiency of action recognition systems, opening new possibilities and challenges in both academic research and industry applications.

 

The special issue aims to bring together cutting-edge research contributions that address the latest developments, challenges, and future directions in the field of action recognition. This special issue will serve as a comprehensive platform for researchers and practitioners to share innovative methods, present novel applications, and discuss the technical challenges and potential solutions in the rapidly evolving landscape of action recognition.

 

Topics of Interest:

We invite high-quality submissions on, but not limited to, the following topics:

 

- Deep learning architectures (CNNs, RNNs, GNNs, Transformers) and learning techniques for action recognition.

- Multimodal action recognition using data fusion from RGB, depth, skeletal data, audio, etc.

- Real-time and efficient action recognition models for edge devices and resource-constrained environments.

- 3D and skeleton-based action recognition, including techniques leveraging human pose estimation and motion dynamics.

- Weakly supervised, zero-shot, and few-shot learning approaches for recognizing actions with limited or no labeled data.

- Action recognition in Extended Reality (XR) environments: Virtual Reality (VR), Augmented Reality (AR), and Mixed Reality (MR) applications.

- Applications in healthcare, sports analytics, entertainment, security, autonomous systems, etc.

- Development of new datasets, benchmarks, and evaluation metrics for action recognition.

- Explainability and interpretability of action recognition models, including visualization techniques and ethical considerations.


Keywords

Action Recognition; Deep Learning; Multimodal Analysis; 3D Vision; Extended Reality (XR); Real-Time Processing; Zero-Shot Learning; Human Activity Recognition; Sensor Fusion; Explainable AI

Published Papers


  • Open Access

    ARTICLE

    Human Motion Prediction Based on Multi-Level Spatial and Temporal Cues Learning

    Jiayi Geng, Yuxuan Wu, Wenbo Lu, Pengxiang Su, Amel Ksibi, Wei Li, Zaffar Ahmed Shaikh, Di Gai
    CMC-Computers, Materials & Continua, Vol.85, No.2, pp. 3689-3707, 2025, DOI:10.32604/cmc.2025.066944
    (This article belongs to the Special Issue: Advances in Action Recognition: Algorithms, Applications, and Emerging Trends)
    Abstract Predicting human motion based on historical motion sequences is a fundamental problem in computer vision, which is at the core of many applications. Existing approaches primarily focus on encoding spatial dependencies among human joints while ignoring the temporal cues and the complex relationships across non-consecutive frames. These limitations hinder the model’s ability to generate accurate predictions over longer time horizons and in scenarios with complex motion patterns. To address the above problems, we proposed a novel multi-level spatial and temporal learning model, which consists of a Cross Spatial Dependencies Encoding Module (CSM) and a Dynamic… More >

  • Open Access

    ARTICLE

    Skeleton-Based Action Recognition Using Graph Convolutional Network with Pose Correction and Channel Topology Refinement

    Yuxin Gao, Xiaodong Duan, Qiguo Dai
    CMC-Computers, Materials & Continua, Vol.83, No.1, pp. 701-718, 2025, DOI:10.32604/cmc.2025.060137
    (This article belongs to the Special Issue: Advances in Action Recognition: Algorithms, Applications, and Emerging Trends)
    Abstract Graph convolutional network (GCN) as an essential tool in human action recognition tasks have achieved excellent performance in previous studies. However, most current skeleton-based action recognition using GCN methods use a shared topology, which cannot flexibly adapt to the diverse correlations between joints under different motion features. The video-shooting angle or the occlusion of the body parts may bring about errors when extracting the human pose coordinates with estimation algorithms. In this work, we propose a novel graph convolutional learning framework, called PCCTR-GCN, which integrates pose correction and channel topology refinement for skeleton-based human action… More >

Share Link