A Game-Theoretic Framework for Strategic Machine Unlearning in Backdoor Mitigation

Xiaolei Ding; Wenjian Liu

doi:10.32604/cmc.2025.072458

Open Access icon Open Access

ARTICLE

A Game-Theoretic Framework for Strategic Machine Unlearning in Backdoor Mitigation

Xiaolei Ding, Wenjian Liu^*

Faculty of Data Science, City University of Macau, Macau, China

* Corresponding Author: Wenjian Liu. Email: email

Computers, Materials & Continua 2026, 88(2), 25 https://doi.org/10.32604/cmc.2025.072458

Received 27 August 2025; Accepted 29 September 2025; Issue published 15 June 2026

Abstract

Backdoor attacks pose a critical threat to the reliability and trustworthiness of machine learning models, as they allow adversaries to manipulate model behavior through the injection of malicious patterns during training. Existing defenses, such as data filtering, fine-tuning, and model pruning, often lack provable guarantees or require retraining from scratch, resulting in significant computational costs. In this work, we propose GTMU (Game-Theoretic Machine Unlearning), a novel backdoor removal framework that formulates the unlearning process as a repeated game between the defender and a virtual attacker. The defender aims to strategically remove poisoned contributions while preserving benign knowledge, whereas the virtual attacker attempts to maintain the backdoor’s effectiveness. We introduce a Stackelberg game formulation to determine optimal unlearning policies and integrate a Nash equilibrium-based update rule to balance model utility and security. Our method leverages influence function approximations to estimate per-sample contribution and employs a regret-minimization strategy to adaptively select unlearning candidates. Experimental evaluations on image classification benchmarks under various backdoor settings demonstrate that GTMU consistently achieves over 95% clean accuracy while reducing backdoor success rates to below 2%, outperforming state-of-the-art backdoor defense methods in both efficiency and robustness. The proposed approach offers a theoretically grounded and computationally efficient solution for secure model deployment in adversarial environments.

Keywords

Machine learning; backdoor defense; game theory

Cite This Article

APA Style

Ding, X., Liu, W. (2026). A Game-Theoretic Framework for Strategic Machine Unlearning in Backdoor Mitigation. Computers, Materials & Continua, 88(2), 25. https://doi.org/10.32604/cmc.2025.072458

Vancouver Style

Ding X, Liu W. A Game-Theoretic Framework for Strategic Machine Unlearning in Backdoor Mitigation. Comput Mater Contin. 2026;88(2):25. https://doi.org/10.32604/cmc.2025.072458

IEEE Style

X. Ding and W. Liu, “A Game-Theoretic Framework for Strategic Machine Unlearning in Backdoor Mitigation,” Comput. Mater. Contin., vol. 88, no. 2, pp. 25, 2026. https://doi.org/10.32604/cmc.2025.072458

BibTex EndNote RIS

Copyright © 2026 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

A Game-Theoretic Framework for Strategic Machine Unlearning in Backdoor Mitigation

Abstract

Keywords

Cite This Article

918

395

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link