Towards Improving Predictive Statistical Learning Model Accuracy by Enhancing Learning Technique

Ali Algarni; Mahmoud Ragab; Wardah Alamri; Samih Mostafa

doi:10.32604/csse.2022.022152

Open Access icon Open Access

ARTICLE

Towards Improving Predictive Statistical Learning Model Accuracy by Enhancing Learning Technique

Ali Algarni¹, Mahmoud Ragab^2,3,4,*, Wardah Alamri⁵, Samih M. Mostafa⁶

1 Statistics Department, Faculty of Science, King Abdulaziz University, Jeddah, 21589, Saudi Arabia
2 Information Technology Department, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, 21589, Saudi Arabia
3 Mathematics Department, Faculty of Science, Al-Azhar University, Naser City, 11884, Egypt
4 Centre of Artificial Intelligence for Precision Medicines, King Abdulaziz University, Jeddah, 21589, Saudi Arabia
5 Educational Technology Department, Educational Graduate Studies Faculty, King Abdulaziz University, Jeddah, 21589, Saudi Arabia
6 Computer Science Department, Faculty of Computers and Information, South Valley University, Qena, 83523, Egypt

* Corresponding Author: Mahmoud Ragab. Email: email

Computer Systems Science and Engineering 2022, 42(1), 303-318. https://doi.org/10.32604/csse.2022.022152

Received 29 July 2021; Accepted 30 August 2021; Issue published 02 December 2021

Abstract

The accuracy of the statistical learning model depends on the learning technique used which in turn depends on the dataset’s values. In most research studies, the existence of missing values (MVs) is a vital problem. In addition, any dataset with MVs cannot be used for further analysis or with any data driven tool especially when the percentage of MVs are high. In this paper, the authors propose a novel algorithm for dealing with MVs depending on the feature selection (FS) of similarity classifier with fuzzy entropy measure. The proposed algorithm imputes MVs in cumulative order. The candidate feature to be manipulated is selected using similarity classifier with Parkash’s fuzzy entropy measure. The predictive model to predict MVs within the candidate feature is the Bayesian Ridge Regression (BRR) technique. Furthermore, any imputed features will be incorporated within the BRR equation to impute the MVs in the next chosen incomplete feature. The proposed algorithm was compared against some practical state-of-the-art imputation methods by conducting an experiment on four medical datasets which were gathered from several databases repository with MVs generated from the three missingness mechanisms. The evaluation metrics of mean absolute error (MAE), root mean square error (RMSE) and coefficient of determination (R² score) were used to measure the performance. The results exhibited that performance vary depending on the size of the dataset, amount of MVs and the missingness mechanism type. Moreover, compared to other methods, the results showed that the proposed method gives better accuracy and less error in most cases.

Keywords

Bayesian ridge regression; fuzzy entropy measure; feature selection; imputation; missing values; missingness mechanisms; similarity classifier; medical dataset

Cite This Article

APA Style

Algarni, A., Ragab, M., Alamri, W., Mostafa, S.M. (2022). Towards Improving Predictive Statistical Learning Model Accuracy by Enhancing Learning Technique. Computer Systems Science and Engineering, 42(1), 303–318. https://doi.org/10.32604/csse.2022.022152

Vancouver Style

Algarni A, Ragab M, Alamri W, Mostafa SM. Towards Improving Predictive Statistical Learning Model Accuracy by Enhancing Learning Technique. Comput Syst Sci Eng. 2022;42(1):303–318. https://doi.org/10.32604/csse.2022.022152

IEEE Style

A. Algarni, M. Ragab, W. Alamri, and S. M. Mostafa, “Towards Improving Predictive Statistical Learning Model Accuracy by Enhancing Learning Technique,” Comput. Syst. Sci. Eng., vol. 42, no. 1, pp. 303–318, 2022. https://doi.org/10.32604/csse.2022.022152

BibTex EndNote RIS

Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Towards Improving Predictive Statistical Learning Model Accuracy by Enhancing Learning Technique

Abstract

Keywords

Cite This Article

2928

1914

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link