TY  - EJOU
AU  - Qu, Jing 
AU  - Li, Yanmei 
AU  - Liu, Changrong 
AU  - Wang, Wen 
AU  - Fu, Weiping 

TI  - Prediction of Assembly Intent for Human-Robot Collaboration Based on Video Analytics and Hidden Markov Model
T2  - Computers, Materials \& Continua

PY  - 2025
VL  - 84
IS  - 2
SN  - 1546-2226

AB  - Despite the gradual transformation of traditional manufacturing by the Human-Robot Collaboration Assembly (HRCA), challenges remain in the robot’s ability to understand and predict human assembly intentions. This study aims to enhance the robot’s comprehension and prediction capabilities of operator assembly intentions by capturing and analyzing operator behavior and movements. We propose a video feature extraction method based on the Temporal Shift Module Network (TSM-ResNet50) to extract spatiotemporal features from assembly videos and differentiate various assembly actions using feature differences between video frames. Furthermore, we construct an action recognition and segmentation model based on the Refined-Multi-Scale Temporal Convolutional Network (Refined-MS-TCN) to identify assembly action intervals and accurately acquire action categories. Experiments on our self-built reducer assembly action dataset demonstrate that our network can classify assembly actions frame by frame, achieving an accuracy rate of 83%. Additionally, we develop a Hidden Markov Model (HMM) integrated with assembly task constraints to predict operator assembly intentions based on the probability transition matrix and assembly task constraints. The experimental results show that our method for predicting operator assembly intentions can achieve an accuracy of 90.6%, which is a 13.3% improvement over the HMM without task constraints.
KW  - Human-robot collaboration assembly; assembly intent prediction; video feature extraction; action recognition and segmentation; HMM

DO  - 10.32604/cmc.2025.065895