Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (29)
  • Open Access

    ARTICLE

    An Attention-Based 6D Pose Estimation Network for Weakly Textured Industrial Parts

    Song Xu1,2,*, Liang Xuan1,2, Yifeng Li1,2, Qiang Zhang1,2

    CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-19, 2026, DOI:10.32604/cmc.2025.070472 - 09 December 2025

    Abstract The 6D pose estimation of objects is of great significance for the intelligent assembly and sorting of industrial parts. In the industrial robot production scenarios, the 6D pose estimation of industrial parts mainly faces two challenges: one is the loss of information and interference caused by occlusion and stacking in the sorting scenario, the other is the difficulty of feature extraction due to the weak texture of industrial parts. To address the above problems, this paper proposes an attention-based pixel-level voting network for 6D pose estimation of weakly textured industrial parts, namely CB-PVNet. On the… More >

  • Open Access

    ARTICLE

    VMHPE: Human Pose Estimation for Virtual Maintenance Tasks

    Shuo Zhang, Hanwu He, Yueming Wu*

    CMC-Computers, Materials & Continua, Vol.85, No.1, pp. 801-826, 2025, DOI:10.32604/cmc.2025.066540 - 29 August 2025

    Abstract Virtual maintenance, as an important means of industrial training and education, places strict requirements on the accuracy of participant pose perception and assessment of motion standardization. However, existing research mainly focuses on human pose estimation in general scenarios, lacking specialized solutions for maintenance scenarios. This paper proposes a virtual maintenance human pose estimation method based on multi-scale feature enhancement (VMHPE), which integrates adaptive input feature enhancement, multi-scale feature correction for improved expression of fine movements and complex poses, and multi-scale feature fusion to enhance keypoint localization accuracy. Meanwhile, this study constructs the first virtual maintenance-specific… More >

  • Open Access

    ARTICLE

    Evaluating Method of Lower Limb Coordination Based on Spatial-Temporal Dependency Networks

    Xuelin Qin1, Huinan Sang2, Shihua Wu2, Shishu Chen2, Zhiwei Chen2, Yongjun Ren2,*

    CMC-Computers, Materials & Continua, Vol.85, No.1, pp. 1959-1980, 2025, DOI:10.32604/cmc.2025.066266 - 29 August 2025

    Abstract As an essential tool for quantitative analysis of lower limb coordination, optical motion capture systems with marker-based encoding still suffer from inefficiency, high costs, spatial constraints, and the requirement for multiple markers. While 3D pose estimation algorithms combined with ordinary cameras offer an alternative, their accuracy often deteriorates under significant body occlusion. To address the challenge of insufficient 3D pose estimation precision in occluded scenarios—which hinders the quantitative analysis of athletes’ lower-limb coordination—this paper proposes a multimodal training framework integrating spatiotemporal dependency networks with text-semantic guidance. Compared to traditional optical motion capture systems, this work… More >

  • Open Access

    REVIEW

    Monocular 3D Human Pose Estimation for REBA Ergonomics: A Critical Review of Recent Advances

    Ahmad Mwfaq Bataineh1,2,*, Ahmad Sufril Azlan Mohamed1

    CMC-Computers, Materials & Continua, Vol.84, No.1, pp. 93-124, 2025, DOI:10.32604/cmc.2025.064250 - 09 June 2025

    Abstract Advancements in deep learning have considerably enhanced techniques for Rapid Entire Body Assessment (REBA) pose estimation by leveraging progress in three-dimensional human modeling. This survey provides an extensive overview of recent advancements, particularly emphasizing monocular image-based methodologies and their incorporation into ergonomic risk assessment frameworks. By reviewing literature from 2016 to 2024, this study offers a current and comprehensive analysis of techniques, existing challenges, and emerging trends in three-dimensional human pose estimation. In contrast to traditional reviews organized by learning paradigms, this survey examines how three-dimensional pose estimation is effectively utilized within musculoskeletal disorder (MSD)… More >

  • Open Access

    ARTICLE

    Self-Supervised Monocular Depth Estimation with Scene Dynamic Pose

    Jing He1, Haonan Zhu2, Chenhao Zhao1, Minrui Zhao3,*

    CMC-Computers, Materials & Continua, Vol.83, No.3, pp. 4551-4573, 2025, DOI:10.32604/cmc.2025.062437 - 19 May 2025

    Abstract Self-supervised monocular depth estimation has emerged as a major research focus in recent years, primarily due to the elimination of ground-truth depth dependence. However, the prevailing architectures in this domain suffer from inherent limitations: existing pose network branches infer camera ego-motion exclusively under static-scene and Lambertian-surface assumptions. These assumptions are often violated in real-world scenarios due to dynamic objects, non-Lambertian reflectance, and unstructured background elements, leading to pervasive artifacts such as depth discontinuities (“holes”), structural collapse, and ambiguous reconstruction. To address these challenges, we propose a novel framework that integrates scene dynamic pose estimation into… More >

  • Open Access

    ARTICLE

    Hourglass-GCN for 3D Human Pose Estimation Using Skeleton Structure and View Correlation

    Ange Chen, Chengdong Wu*, Chuanjiang Leng

    CMC-Computers, Materials & Continua, Vol.82, No.1, pp. 173-191, 2025, DOI:10.32604/cmc.2024.059284 - 03 January 2025

    Abstract Previous multi-view 3D human pose estimation methods neither correlate different human joints in each view nor model learnable correlations between the same joints in different views explicitly, meaning that skeleton structure information is not utilized and multi-view pose information is not completely fused. Moreover, existing graph convolutional operations do not consider the specificity of different joints and different views of pose information when processing skeleton graphs, making the correlation weights between nodes in the graph and their neighborhood nodes shared. Existing Graph Convolutional Networks (GCNs) cannot extract global and deep-level skeleton structure information and view… More >

  • Open Access

    ARTICLE

    DAUNet: Detail-Aware U-Shaped Network for 2D Human Pose Estimation

    Xi Li1,2, Yuxin Li2, Zhenhua Xiao3,*, Zhenghua Huang1, Lianying Zou1

    CMC-Computers, Materials & Continua, Vol.81, No.2, pp. 3325-3349, 2024, DOI:10.32604/cmc.2024.056464 - 18 November 2024

    Abstract Human pose estimation is a critical research area in the field of computer vision, playing a significant role in applications such as human-computer interaction, behavior analysis, and action recognition. In this paper, we propose a U-shaped keypoint detection network (DAUNet) based on an improved ResNet subsampling structure and spatial grouping mechanism. This network addresses key challenges in traditional methods, such as information loss, large network redundancy, and insufficient sensitivity to low-resolution features. DAUNet is composed of three main components. First, we introduce an improved BottleNeck block that employs partial convolution and strip pooling to reduce… More >

  • Open Access

    ARTICLE

    Human Interaction Recognition in Surveillance Videos Using Hybrid Deep Learning and Machine Learning Models

    Vesal Khean1, Chomyong Kim2, Sunjoo Ryu2, Awais Khan1, Min Kyung Hong3, Eun Young Kim4, Joungmin Kim5, Yunyoung Nam3,*

    CMC-Computers, Materials & Continua, Vol.81, No.1, pp. 773-787, 2024, DOI:10.32604/cmc.2024.056767 - 15 October 2024

    Abstract Human Interaction Recognition (HIR) was one of the challenging issues in computer vision research due to the involvement of multiple individuals and their mutual interactions within video frames generated from their movements. HIR requires more sophisticated analysis than Human Action Recognition (HAR) since HAR focuses solely on individual activities like walking or running, while HIR involves the interactions between people. This research aims to develop a robust system for recognizing five common human interactions, such as hugging, kicking, pushing, pointing, and no interaction, from video sequences using multiple cameras. In this study, a hybrid Deep… More >

  • Open Access

    ARTICLE

    Analyzing the Impact of Scene Transitions on Indoor Camera Localization through Scene Change Detection in Real-Time

    Muhammad S. Alam1,5,*, Farhan B. Mohamed1,3, Ali Selamat2, Faruk Ahmed4, AKM B. Hossain6,7

    Intelligent Automation & Soft Computing, Vol.39, No.3, pp. 417-436, 2024, DOI:10.32604/iasc.2024.051999 - 11 July 2024

    Abstract Real-time indoor camera localization is a significant problem in indoor robot navigation and surveillance systems. The scene can change during the image sequence and plays a vital role in the localization performance of robotic applications in terms of accuracy and speed. This research proposed a real-time indoor camera localization system based on a recurrent neural network that detects scene change during the image sequence. An annotated image dataset trains the proposed system and predicts the camera pose in real-time. The system mainly improved the localization performance of indoor cameras by more accurately predicting the camera More >

  • Open Access

    ARTICLE

    Abnormal Action Recognition with Lightweight Pose Estimation Network in Electric Power Training Scene

    Yunfeng Cai1, Ran Qin1, Jin Tang1, Long Zhang1, Xiaotian Bi1, Qing Yang2,*

    CMC-Computers, Materials & Continua, Vol.79, No.3, pp. 4979-4994, 2024, DOI:10.32604/cmc.2024.050435 - 20 June 2024

    Abstract Electric power training is essential for ensuring the safety and reliability of the system. In this study, we introduce a novel Abnormal Action Recognition (AAR) system that utilizes a Lightweight Pose Estimation Network (LPEN) to efficiently and effectively detect abnormal fall-down and trespass incidents in electric power training scenarios. The LPEN network, comprising three stages—MobileNet, Initial Stage, and Refinement Stage—is employed to swiftly extract image features, detect human key points, and refine them for accurate analysis. Subsequently, a Pose-aware Action Analysis Module (PAAM) captures the positional coordinates of human skeletal points in each frame. Finally, More >

Displaying 1-10 on page 1 of 29. Per Page