Open Access iconOpen Access

ARTICLE

A Machine Learning-Based Framework for Heart Disease Diagnosis Using a Comprehensive Patient Cohort

Saadia Tabassum1,2, Fazal Muhammad2, Muhammad Ayaz Khan3, Muhammad Uzair Khan2,4, Dawar Awan4, Neelam Gohar5, Shahid Khan6, Amal Al-Rasheed7,*

1 Department of Electronics Engineering Technology, Shuhada-e-APS University of Technology, Nowshera, 24170, Pakistan
2 Department of Electrical Engineering, University of Engineering & Technology, Mardan, 02323, Pakistan
3 Department of Business and Management, University of Chester, Chester, 01244, UK
4 Department of Electrical Engineering Technology, Shuhada-e-APS University of Technology, Nowshera, 24170, Pakistan
5 Department of Computer Science, Shaheed Benazir Bhutto Women University, Peshawar, 25000, Pakistan
6 Department of Electrical Engineering, COMSATS University Islamabad, Abbottabad Campus, Abbottabad, 22060, Pakistan
7 Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh, 11671, Saudi Arabia

* Corresponding Author: Amal Al-Rasheed. Email: email

Computers, Materials & Continua 2025, 84(1), 1253-1278. https://doi.org/10.32604/cmc.2025.065423

Abstract

Early and accurate detection of Heart Disease (HD) is critical for improving patient outcomes, as HD remains a leading cause of mortality worldwide. Timely and precise prediction can aid in preventive interventions, reducing fatal risks associated with misdiagnosis. Machine learning (ML) models have gained significant attention in healthcare for their ability to assist professionals in diagnosing diseases with high accuracy. This study utilizes 918 instances from publicly available UCI and Kaggle datasets to develop and compare the performance of various ML models, including Adaptive Boosting (AB), Naïve Bayes (NB), Extreme Gradient Boosting (XGB), Bagging, and Logistic Regression (LR). Before model training, data preprocessing techniques such as handling missing values, outlier detection using Isolation Forest, and feature scaling were applied to improve model performance. The evaluation was conducted using performance metrics, including accuracy, precision, recall, and F1-score. Among the tested models, XGB demonstrated the highest predictive performance, achieving an accuracy of 94.34% and an F1-score of 95.19%, surpassing other models and previous studies in HD prediction. LR closely followed with an accuracy of 93.08% and an F1-score of 93.99%, indicating competitive performance. In contrast, NB exhibited the lowest performance, with an accuracy of 88.05% and an F1-score of 89.02%, highlighting its limitations in handling complex patterns within the dataset. Although ML models show superior performance as compared to previous studies, some limitations exist, including the use of publicly available datasets, which may not fully capture real-world clinical variations, and the lack of feature selection techniques, which could impact model interpretability and robustness. Despite these limitations, the findings highlight the potential of ML-based frameworks for accurate and efficient HD detection, demonstrating their value as decision-support tools in clinical settings.

Keywords

Heart disease; machine learning; artificial intelligence; accuracy; prediction

Cite This Article

APA Style
Tabassum, S., Muhammad, F., Khan, M.A., Khan, M.U., Awan, D. et al. (2025). A Machine Learning-Based Framework for Heart Disease Diagnosis Using a Comprehensive Patient Cohort. Computers, Materials & Continua, 84(1), 1253–1278. https://doi.org/10.32604/cmc.2025.065423
Vancouver Style
Tabassum S, Muhammad F, Khan MA, Khan MU, Awan D, Gohar N, et al. A Machine Learning-Based Framework for Heart Disease Diagnosis Using a Comprehensive Patient Cohort. Comput Mater Contin. 2025;84(1):1253–1278. https://doi.org/10.32604/cmc.2025.065423
IEEE Style
S. Tabassum et al., “A Machine Learning-Based Framework for Heart Disease Diagnosis Using a Comprehensive Patient Cohort,” Comput. Mater. Contin., vol. 84, no. 1, pp. 1253–1278, 2025. https://doi.org/10.32604/cmc.2025.065423



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 357

    View

  • 111

    Download

  • 0

    Like

Share Link