Open Access iconOpen Access

ARTICLE

A Game-Theoretic Framework for Strategic Machine Unlearning in Backdoor Mitigation

Xiaolei Ding, Wenjian Liu*

Faculty of Data Science, City University of Macau, Macau, China

* Corresponding Author: Wenjian Liu. Email: email

Computers, Materials & Continua 2026, 88(2), 25 https://doi.org/10.32604/cmc.2025.072458

Abstract

Backdoor attacks pose a critical threat to the reliability and trustworthiness of machine learning models, as they allow adversaries to manipulate model behavior through the injection of malicious patterns during training. Existing defenses, such as data filtering, fine-tuning, and model pruning, often lack provable guarantees or require retraining from scratch, resulting in significant computational costs. In this work, we propose GTMU (Game-Theoretic Machine Unlearning), a novel backdoor removal framework that formulates the unlearning process as a repeated game between the defender and a virtual attacker. The defender aims to strategically remove poisoned contributions while preserving benign knowledge, whereas the virtual attacker attempts to maintain the backdoor’s effectiveness. We introduce a Stackelberg game formulation to determine optimal unlearning policies and integrate a Nash equilibrium-based update rule to balance model utility and security. Our method leverages influence function approximations to estimate per-sample contribution and employs a regret-minimization strategy to adaptively select unlearning candidates. Experimental evaluations on image classification benchmarks under various backdoor settings demonstrate that GTMU consistently achieves over 95% clean accuracy while reducing backdoor success rates to below 2%, outperforming state-of-the-art backdoor defense methods in both efficiency and robustness. The proposed approach offers a theoretically grounded and computationally efficient solution for secure model deployment in adversarial environments.

Keywords

Machine learning; backdoor defense; game theory

Cite This Article

APA Style
Ding, X., Liu, W. (2026). A Game-Theoretic Framework for Strategic Machine Unlearning in Backdoor Mitigation. Computers, Materials & Continua, 88(2), 25. https://doi.org/10.32604/cmc.2025.072458
Vancouver Style
Ding X, Liu W. A Game-Theoretic Framework for Strategic Machine Unlearning in Backdoor Mitigation. Comput Mater Contin. 2026;88(2):25. https://doi.org/10.32604/cmc.2025.072458
IEEE Style
X. Ding and W. Liu, “A Game-Theoretic Framework for Strategic Machine Unlearning in Backdoor Mitigation,” Comput. Mater. Contin., vol. 88, no. 2, pp. 25, 2026. https://doi.org/10.32604/cmc.2025.072458



cc Copyright © 2026 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 260

    View

  • 114

    Download

  • 0

    Like

Share Link