Open Access
ARTICLE
Machine Learning Stroke Prediction in Smart Healthcare: Integrating Fuzzy K-Nearest Neighbor and Artificial Neural Networks with Feature Selection Techniques
1 Department Information System Study Program, Faculty of Science and Technology, Universitas Airlangga, Surabaya, 60286, Indonesia
2 School of Software, Northwestern Polytechnical University, Xi’an, 710072, China
3 Research Center for Quantum Engineering Design, Faculty of Science and Technology, Universitas Airlangga, Surabaya, 60286, Indonesia
4 School of Computer Science and Software Engineering, Shenzhen University, Shenzhen, 518061, China
5 Cybersecurity Center, Prince Mohammad Bin Fahd University, 617, Al Jawharah, Khobar, Dhahran, 34754, Saudi Arabia
6 Cardiff School of Technologies, Cardiff Metropolitan University, Western Avenue, Cardiff, CF5 2YB, UK
7 Instituto de Telecomunicações, Escola Superior de Tecnologia e Gestão de Águeda, Universidade de Aveiro, Águeda, 3750-127, Portugal
* Corresponding Authors: Ira Puspitasari. Email: ; Ivan Miguel Pires. Email:
(This article belongs to the Special Issue: Advances in Deep Learning and Neural Networks: Architectures, Applications, and Challenges)
Computers, Materials & Continua 2025, 82(3), 5115-5134. https://doi.org/10.32604/cmc.2025.062605
Received 22 December 2024; Accepted 03 February 2025; Issue published 06 March 2025
Abstract
This research explores the use of Fuzzy K-Nearest Neighbor (F-KNN) and Artificial Neural Networks (ANN) for predicting heart stroke incidents, focusing on the impact of feature selection methods, specifically Chi-Square and Best First Search (BFS). The study demonstrates that BFS significantly enhances the performance of both classifiers. With BFS preprocessing, the ANN model achieved an impressive accuracy of 97.5%, precision and recall of 97.5%, and an Receiver Operating Characteristics (ROC) area of 97.9%, outperforming the Chi-Square-based ANN, which recorded an accuracy of 91.4%. Similarly, the F-KNN model with BFS achieved an accuracy of 96.3%, precision and recall of 96.3%, and a Receiver Operating Characteristics (ROC) area of 96.2%, surpassing the performance of the Chi-Square F-KNN model, which showed an accuracy of 95%. These results highlight that BFS improves the ability to select the most relevant features, contributing to more reliable and accurate stroke predictions. The findings underscore the importance of using advanced feature selection methods like BFS to enhance the performance of machine learning models in healthcare applications, leading to better stroke risk management and improved patient outcomes.Keywords
Cite This Article

This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.