Open Access iconOpen Access

ARTICLE

crossmark

A New Hybrid Feature Selection Sequence for Predicting Breast Cancer Survivability Using Clinical Datasets

E. Jenifer Sweetlin*, S. Saudia

Centre for Information Technology and Engineering, Manonmaniam Sundaranar University, Tirunelveli, India

* Corresponding Author: E. Jenifer Sweetlin. Email: email

Intelligent Automation & Soft Computing 2023, 37(1), 343-367. https://doi.org/10.32604/iasc.2023.036742

Abstract

This paper proposes a hybrid feature selection sequence complemented with filter and wrapper concepts to improve the accuracy of Machine Learning (ML) based supervised classifiers for classifying the survivability of breast cancer patients into classes, living and deceased using METABRIC and Surveillance, Epidemiology and End Results (SEER) datasets. The ML-based classifiers used in the analysis are: Multiple Logistic Regression, K-Nearest Neighbors, Decision Tree, Random Forest, Support Vector Machine and Multilayer Perceptron. The workflow of the proposed ML algorithm sequence comprises the following stages: data cleaning, data balancing, feature selection via a filter and wrapper sequence, cross validation-based training, testing and performance evaluation. The results obtained are compared in terms of the following classification metrics: Accuracy, Precision, F1 score, True Positive Rate, True Negative Rate, False Positive Rate, False Negative Rate, Area under the Receiver Operating Characteristics curve, Area under the Precision-Recall curve and Mathews Correlation Coefficient. The comparison shows that the proposed feature selection sequence produces better results from all supervised classifiers than all other feature selection sequences considered in the analysis.

Keywords


Cite This Article

APA Style
Sweetlin, E.J., Saudia, S. (2023). A new hybrid feature selection sequence for predicting breast cancer survivability using clinical datasets. Intelligent Automation & Soft Computing, 37(1), 343-367. https://doi.org/10.32604/iasc.2023.036742
Vancouver Style
Sweetlin EJ, Saudia S. A new hybrid feature selection sequence for predicting breast cancer survivability using clinical datasets. Intell Automat Soft Comput . 2023;37(1):343-367 https://doi.org/10.32604/iasc.2023.036742
IEEE Style
E.J. Sweetlin and S. Saudia, "A New Hybrid Feature Selection Sequence for Predicting Breast Cancer Survivability Using Clinical Datasets," Intell. Automat. Soft Comput. , vol. 37, no. 1, pp. 343-367. 2023. https://doi.org/10.32604/iasc.2023.036742



cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 842

    View

  • 418

    Download

  • 0

    Like

Share Link