Open Access iconOpen Access

ARTICLE

crossmark

Fusing Geometric and Temporal Deep Features for High-Precision Arabic Sign Language Recognition

Yazeed Alkhrijah1,2, Shehzad Khalid3, Syed Muhammad Usman4,*, Amina Jameel3, Danish Hamid5

1 King Salman Center for Disability Research (KSCDR), Riyadh, 11614, Saudi Arabia
2 Department of Electrical Engineering, Imam Mohammad ibn Saud Islamic University (IMSIU), Riyadh, 11623, Saudi Arabia
3 Department of Computer Engineering, Bahria University, Islamabad, 44000, Pakistan
4 Department of Computer Science, Bahria University, Islamabad, 44000, Pakistan
5 Department of Creative Technologies, Air University, Islamabad, 44000, Pakistan

* Corresponding Author: Syed Muhammad Usman. Email: email

(This article belongs to the Special Issue: Machine Learning and Deep Learning-Based Pattern Recognition)

Computer Modeling in Engineering & Sciences 2025, 144(1), 1113-1141. https://doi.org/10.32604/cmes.2025.068726

Abstract

Arabic Sign Language (ArSL) recognition plays a vital role in enhancing the communication for the Deaf and Hard of Hearing (DHH) community. Researchers have proposed multiple methods for automated recognition of ArSL; however, these methods face multiple challenges that include high gesture variability, occlusions, limited signer diversity, and the scarcity of large annotated datasets. Existing methods, often relying solely on either skeletal data or video-based features, struggle with generalization and robustness, especially in dynamic and real-world conditions. This paper proposes a novel multimodal ensemble classification framework that integrates geometric features derived from 3D skeletal joint distances and angles with temporal features extracted from RGB videos using the Inflated 3D ConvNet (I3D). By fusing these complementary modalities at the feature level and applying a majority-voting ensemble of XGBoost, Random Forest, and Support Vector Machine classifiers, the framework robustly captures both spatial configurations and motion dynamics of sign gestures. Feature selection using the Pearson Correlation Coefficient further enhances efficiency by reducing redundancy. Extensive experiments on the ArabSign dataset, which includes RGB videos and corresponding skeletal data, demonstrate that the proposed approach significantly outperforms state-of-the-art methods, achieving an average F1-score of 97% using a majority-voting ensemble of XGBoost, Random Forest, and SVM classifiers, and improving recognition accuracy by more than 7% over previous best methods. This work not only advances the technical state-of-the-art in ArSL recognition but also provides a scalable, real-time solution for practical deployment in educational, social, and assistive communication technologies. Even though this study is about Arabic Sign Language, the framework proposed here can be extended to different sign languages, creating possibilities for potentially worldwide applicability in sign language recognition tasks.

Keywords

Arabic sign language recognition; multimodal feature fusion; ensemble classification; skeletal data; inflated 3D ConvNet (I3D)

Cite This Article

APA Style
Alkhrijah, Y., Khalid, S., Usman, S.M., Jameel, A., Hamid, D. (2025). Fusing Geometric and Temporal Deep Features for High-Precision Arabic Sign Language Recognition. Computer Modeling in Engineering & Sciences, 144(1), 1113–1141. https://doi.org/10.32604/cmes.2025.068726
Vancouver Style
Alkhrijah Y, Khalid S, Usman SM, Jameel A, Hamid D. Fusing Geometric and Temporal Deep Features for High-Precision Arabic Sign Language Recognition. Comput Model Eng Sci. 2025;144(1):1113–1141. https://doi.org/10.32604/cmes.2025.068726
IEEE Style
Y. Alkhrijah, S. Khalid, S. M. Usman, A. Jameel, and D. Hamid, “Fusing Geometric and Temporal Deep Features for High-Precision Arabic Sign Language Recognition,” Comput. Model. Eng. Sci., vol. 144, no. 1, pp. 1113–1141, 2025. https://doi.org/10.32604/cmes.2025.068726



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1199

    View

  • 684

    Download

  • 0

    Like

Share Link