Open Access iconOpen Access

ARTICLE

Towards Real-Time Multi-Person Pose Estimation via Feature Selection and Sharpening Mechanisms

Chengang Dong1,2, Yongkang Ding2, Jianwei Hu1,3,*

1 School of Mathematics and Statistics, Huangshan University, Huangshan, China
2 College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, China
3 Huangshan Technology Innovation Center for Digital Economy and Big Data Analysis, Huangshan, China

* Corresponding Author: Jianwei Hu. Email: email

Computer Modeling in Engineering & Sciences 2026, 146(3), 32 https://doi.org/10.32604/cmes.2026.079062

Abstract

Real-time multi-person pose estimation (MPE) built upon neural network architectures aims to simultaneously detect multiple human instances and regress joint coordinates in dynamic scenes. However, due to factors such as high model complexity and limited expression of keypoint information, both the efficiency and accuracy of real-time MPE remain to be improved. To mitigate the adverse impacts caused by the aforementioned issues, this work develops FSEM-Pose, a real-time MPE model rooted in the YOLOv10 framework. In detail, first, FSEM-Pose upgrades the backbone module of the baseline network by introducing the Feature Shuffling-Convolution (FS-Conv), which effectively reduces the backbone size while maximizing the retention of spatial information from the input image. Second, FSEM-Pose incorporates a Feature Saliency Enhancement Module (FSEM) to strengthen the feature encoding of human keypoints, thereby improving the accuracy of pose estimation. Finally, FSEM-Pose further enhances inference efficiency via a lightweight optimization of the head using shared convolutional layers. Our method achieves competitive results across multiple accuracy and efficiency metrics on the MS COCO 2017 and CrowdPose datasets. While being lightweight in design, it improves average precision (AP) by 2.1% and 2.5%, respectively.

Keywords

Pose estimation; feature sharpening; lightweight; YOLOv10

Cite This Article

APA Style
Dong, C., Ding, Y., Hu, J. (2026). Towards Real-Time Multi-Person Pose Estimation via Feature Selection and Sharpening Mechanisms. Computer Modeling in Engineering & Sciences, 146(3), 32. https://doi.org/10.32604/cmes.2026.079062
Vancouver Style
Dong C, Ding Y, Hu J. Towards Real-Time Multi-Person Pose Estimation via Feature Selection and Sharpening Mechanisms. Comput Model Eng Sci. 2026;146(3):32. https://doi.org/10.32604/cmes.2026.079062
IEEE Style
C. Dong, Y. Ding, and J. Hu, “Towards Real-Time Multi-Person Pose Estimation via Feature Selection and Sharpening Mechanisms,” Comput. Model. Eng. Sci., vol. 146, no. 3, pp. 32, 2026. https://doi.org/10.32604/cmes.2026.079062



cc Copyright © 2026 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 11

    View

  • 6

    Download

  • 0

    Like

Share Link