A Study on the Explainability of Thyroid Cancer Prediction: SHAP Values and Association-Rule Based Feature Integration Framework

Sujithra Sankar^1,*, S. Sathyalakshmi²
1 Department of Computer Applications, Hindustan Institute of Technology and Science, Chennai, Tamil Nadu, India
2 Department of Computer Engineering, Hindustan Institute of Technology and Science, Chennai, Tamil Nadu, India
* Corresponding Author: Sujithra Sankar. Email: email
(This article belongs to the Special Issue: Intelligent Detection Methods for AI-Powered Healthcare and Enhanced Medical Insights)

Computers, Materials & Continua https://doi.org/10.32604/cmc.2024.048408

Received 06 December 2023; Accepted 29 February 2024; Published online 25 April 2024

Download PDF

Abstract

In the era of advanced machine learning techniques, the development of accurate predictive models for complex medical conditions, such as thyroid cancer, has shown remarkable progress. Accurate predictive models for thyroid cancer enhance early detection, improve resource allocation, and reduce overtreatment. However, the widespread adoption of these models in clinical practice demands predictive performance along with interpretability and transparency. This paper proposes a novel association-rule based feature-integrated machine learning model which shows better classification and prediction accuracy than present state-of-the-art models. Our study also focuses on the application of SHapley Additive exPlanations (SHAP) values as a powerful tool for explaining thyroid cancer prediction models. In the proposed method, the association-rule based feature integration framework identifies frequently occurring attribute combinations in the dataset. The original dataset is used in training machine learning models, and further used in generating SHAP values from these models. In the next phase, the dataset is integrated with the dominant feature sets identified through association-rule based analysis. This new integrated dataset is used in re-training the machine learning models. The new SHAP values generated from these models help in validating the contributions of feature sets in predicting malignancy. The conventional machine learning models lack interpretability, which can hinder their integration into clinical decision-making systems. In this study, the SHAP values are introduced along with association-rule based feature integration as a comprehensive framework for understanding the contributions of feature sets in modelling the predictions. The study discusses the importance of reliable predictive models for early diagnosis of thyroid cancer, and a validation framework of explainability. The proposed model shows an accuracy of 93.48%. Performance metrics such as precision, recall, F1-score, and the area under the receiver operating characteristic (AUROC) are also higher than the baseline models. The results of the proposed model help us identify the dominant feature sets that impact thyroid cancer classification and prediction. The features {calcification} and {shape} consistently emerged as the top-ranked features associated with thyroid malignancy, in both association-rule based interestingness metric values and SHAP methods. The paper highlights the potential of the rule-based integrated models with SHAP in bridging the gap between the machine learning predictions and the interpretability of this prediction which is required for real-world medical applications.

Keywords

Explainable AI; machine learning; clinical decision support systems; thyroid cancer; association-rule based framework; SHAP values; classification and prediction

Downloads
- Full-Text PDF
Citation Tools
- BibTex
- EndNote
- RIS

99

View
13

Download
0

Like

Brake Fault Diagnosis Through Machine Learning Approaches – A Review
Alamelu Manghai T.M., Jegadeeshwaran...
A Comparative Study of Bayes Classifiers for Blade Fault Diagnosis in Wind Turbines through Vibration Signals
A. Joshuva, V. Sugumaran
Condition Monitoring of Roller Bearing by K-Star Classifier and K-Nearest Neighborhood Classifier Using Sound Signal.
Rahul Kumar Sharma, V. Sugumaran,...
Use of Discrete Wavelet Features and Support Vector Machine for Fault Diagnosis of Face Milling Tool
C. K. Madhusudana, N. Gangadhar,...
Vibration Based Fault Diagnosis of a Hydraulic Brake System using Variational Mode Decomposition (VMD)
R. Jegadeeshwaran, V. Sugumaran,...

All issues

Online First

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

A Study on the Explainability of Thyroid Cancer Prediction: SHAP Values and Association-Rule Based Feature Integration Framework

Abstract

Keywords

99

13

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link