Submission Deadline: 31 July 2026 View: 211 Submit to Special Issue
Prof. Rui Yao
Email: ruiyao@cumt.edu.cn
Affiliation: School of Computer Sciences and Technology, China University of Mining and Technology, Xuzhou, 221116, China
Research Interests: computer vision, pattern recognition, deep learning, and artificial intelligence
Prof. Huanlong Zhang
Email: zhl_lit@163.com
Affiliation: College of Electric and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, 450000, China
Research Interests: pattern recognition, machine learning, image processing, computer vision, and intelligent man-machine systems
Assoc. Prof. Hancheng Zhu
Email: zhuhancheng@cumt.edu.cn
Affiliation: School of Computer Sciences and Technology, China University of Mining and Technology, Xuzhou, 221116, China
Research Interests: computer vision, visual aesthetics assessment and enhancement
Dr. Kunyang Sun
Email: kunyang_sun@cumt.edu.cn
Affiliation: School of Computer Sciences and Technology, China University of Mining and Technology, Xuzhou, 221116, China
Research Interests: object detection,segmentation,6G communication
With the explosive growth of video data across surveillance, autonomous driving, robotics, and multimedia platforms, robust object tracking and scene-understanding have become central to modern intelligent systems and interactive applications.
This special issue aims to bring together cutting-edge research on video object tracking and scene understanding from both theoretical and applied perspectives. We invite manuscripts that make significant advances in modeling temporal dynamics, leveraging multimodal inputs (e.g., RGB, T, D, event), exploiting self-supervised or few-shot regimes, and integrating scene semantics and context into tracking pipelines. The scope includes new architectures, generative and adversarial strategies, domain adaptation, and real-world deployment, with emphasis on both algorithmic novelty and system-level performance.
Suggested Themes
- Multimodal video tracking (RGB-T, RGB-D, thermal, event-based)
- Transformer, graph, and generative model frameworks for video tracking
- Self-supervised, few-shot, or zero-shot tracking approaches
- Robust and adversarial tracking in complex scenarios (occlusion, lighting, deformation)
- Scene-understanding and semantic context integration for tracking (behaviour prediction, crowd analytics)
- Lightweight, real-time tracking systems for edge/embedded platforms
- Applications in autonomous driving, UAVs, robotics, smart surveillance, and multimedia analytics


Submit a Paper
Propose a Special lssue