A Game-Theoretic Framework for Strategic Machine Unlearning in Backdoor Mitigation

Xiaolei Ding, Wenjian Liu^*
Faculty of Data Science, City University of Macau, Macau, China
* Corresponding Author: Wenjian Liu. Email: email

Computers, Materials & Continua https://doi.org/10.32604/cmc.2025.072458

Received 27 August 2025; Accepted 29 September 2025; Published online 25 May 2026

Download PDF

Abstract

Backdoor attacks pose a critical threat to the reliability and trustworthiness of machine learning models, as they allow adversaries to manipulate model behavior through the injection of malicious patterns during training. Existing defenses, such as data filtering, fine-tuning, and model pruning, often lack provable guarantees or require retraining from scratch, resulting in significant computational costs. In this work, we propose GTMU (Game-Theoretic Machine Unlearning), a novel backdoor removal framework that formulates the unlearning process as a repeated game between the defender and a virtual attacker. The defender aims to strategically remove poisoned contributions while preserving benign knowledge, whereas the virtual attacker attempts to maintain the backdoor’s effectiveness. We introduce a Stackelberg game formulation to determine optimal unlearning policies and integrate a Nash equilibrium-based update rule to balance model utility and security. Our method leverages influence function approximations to estimate per-sample contribution and employs a regret-minimization strategy to adaptively select unlearning candidates. Experimental evaluations on image classification benchmarks under various backdoor settings demonstrate that GTMU consistently achieves over 95% clean accuracy while reducing backdoor success rates to below 2%, outperforming state-of-the-art backdoor defense methods in both efficiency and robustness. The proposed approach offers a theoretically grounded and computationally efficient solution for secure model deployment in adversarial environments.

Keywords

Machine learning; backdoor defense; game theory

Downloads
- Full-Text PDF
Citation Tools
- BibTex
- EndNote
- RIS

157

View
51

Download
0

Like

Sailfish Optimizer with EfficientNet Model for Apple Leaf Disease Detection
Mazen Mushabab Alqahtani, Ashit...
Deep Learning-Based Program-Wide Binary Code Similarity for Smart Contracts
Yuan Zhuang, Baobao Wang, Jianguo...
Crops Leaf Diseases Recognition: A Framework of Optimum Deep Learning Features
Shafaq Abbas, Muhammad Attique...
Hybrid Mobile Cloud Computing Architecture with Load Balancing for Healthcare Systems
Ahyoung Lee, Jui Mhatre, Rupak...
Image-Based Automatic Energy Meter Reading Using Deep Learning
Muhammad Imran, Hafeez Anwar,...

All issues

Online First

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

A Game-Theoretic Framework for Strategic Machine Unlearning in Backdoor Mitigation

Abstract

Keywords

157

51

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link