Open Access iconOpen Access

ARTICLE

SMOTE-Optimized Machine Learning Framework for Predicting Retention in Workforce Development Training

Abdulaziz Alshahrani*

Faculty of Computer and Information Systems, Islamic University of Madinah, P.O. Box 170, Madinah, 42351, Saudi Arabia

* Corresponding Author: Abdulaziz Alshahrani. Email: email

Computers, Materials & Continua 2025, 85(2), 4067-4090. https://doi.org/10.32604/cmc.2025.065211

Abstract

High dropout rates in short-term job skills training programs hinder workforce development. This study applies machine learning to predict program completion while addressing class imbalance challenges. A dataset of 6548 records with 24 demographic, educational, program-specific, and employment-related features was analyzed. Data preprocessing involved cleaning, encoding categorical variables, and balancing the dataset using the Synthetic Minority Oversampling Technique (SMOTE), as only 15.9% of participants were dropouts. six machine learning models—Logistic Regression, Random Forest, Support Vector Machine, K-Nearest Neighbors, Naïve Bayes, and XGBoost—were evaluated on both balanced and unbalanced datasets using an 80-20 train-test split. Performance was assessed using Accuracy, Precision, Recall, F1-score, and ROC-AUC. XGBoost achieved the highest performance on the balanced dataset, with an F1-score of 0.9200 and a ROC-AUC of 0.9684, followed by Random Forest. These findings highlight the potential of machine learning for early identification of dropout trainees, aiding in retention strategies for workforce training. The results support the integration of predictive analytics to optimize intervention efforts in short-term training programs.

Keywords

Predictive analytics; workforce training; machine learning; SMOTE

Cite This Article

APA Style
Alshahrani, A. (2025). SMOTE-Optimized Machine Learning Framework for Predicting Retention in Workforce Development Training. Computers, Materials & Continua, 85(2), 4067–4090. https://doi.org/10.32604/cmc.2025.065211
Vancouver Style
Alshahrani A. SMOTE-Optimized Machine Learning Framework for Predicting Retention in Workforce Development Training. Comput Mater Contin. 2025;85(2):4067–4090. https://doi.org/10.32604/cmc.2025.065211
IEEE Style
A. Alshahrani, “SMOTE-Optimized Machine Learning Framework for Predicting Retention in Workforce Development Training,” Comput. Mater. Contin., vol. 85, no. 2, pp. 4067–4090, 2025. https://doi.org/10.32604/cmc.2025.065211



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 544

    View

  • 408

    Download

  • 0

    Like

Share Link