Strategy Selection for Moving Target Defense in Incomplete Information Game

Huan Zhang; Kangfeng Zheng; Xiujuan Wang; Shoushan Luo; Bin Wu

doi:10.32604/cmc.2020.06553

Open Access icon Open Access

ARTICLE

Strategy Selection for Moving Target Defense in Incomplete Information Game

Huan Zhang¹, Kangfeng Zheng^{1, *}, Xiujuan Wang², Shoushan Luo¹, Bin Wu¹

1 School of CyberSpace Security, Beijing University of Posts and Telecommunications, Beijing, 100088, China.
2 College of Computer Sciences, Beijing University of Technology, Beijing, 100124, China.

* Corresponding Author: Kangfeng Zheng. Email: email .

Computers, Materials & Continua 2020, 62(2), 763-786. https://doi.org/10.32604/cmc.2020.06553

Download PDF

Abstract

As a core component of the network, web applications have become one of the preferred targets for attackers because the static configuration of web applications simplifies the exploitation of vulnerabilities by attackers. Although the moving target defense (MTD) has been proposed to increase the attack difficulty for the attackers, there is no solo approach can cope with different attacks; in addition, it is impossible to implement all these approaches simultaneously due to the resource limitation. Thus, the selection of an optimal defense strategy based on MTD has become the focus of research. In general, the confrontation of two players in the security domain is viewed as a stochastic game, and the reward matrices are known to both players. However, in a real security confrontation, this scenario represents an incomplete information game. Each player can only observe the actions performed by the opponent, and the observed actions are not completely accurate. To accurately describe the attacker’s reward function to reach the Nash equilibrium, this work simulated and updated the strategy selection distribution of the attacker by observing and investigating the strategy selection history of the attacker. Next, the possible rewards of the attacker in each confrontation via the observation matrix were corrected. On this basis, the Nash-Q learning algorithm with reward quantification was proposed to select the optimal strategy. Moreover, the performances of the Minimax-Q learning algorithm and Naive-Q learning algorithm were compared and analyzed in the MTD environment. Finally, the experimental results showed that the strategy selection algorithm can enable defenders to select a more reasonable defensive strategy and achieve the maximum possible reward.

Keywords

Moving target defense, Nash-Q learning algorithm, optimal strategy selection, incomplete information game, web service.

Cite This Article

APA Style

Zhang, H., Zheng, K., Wang, X., Luo, S., Wu, B. (2020). Strategy Selection for Moving Target Defense in Incomplete Information Game. Computers, Materials & Continua, 62(2), 763–786. https://doi.org/10.32604/cmc.2020.06553

Vancouver Style

Zhang H, Zheng K, Wang X, Luo S, Wu B. Strategy Selection for Moving Target Defense in Incomplete Information Game. Comput Mater Contin. 2020;62(2):763–786. https://doi.org/10.32604/cmc.2020.06553

IEEE Style

H. Zhang, K. Zheng, X. Wang, S. Luo, and B. Wu, “Strategy Selection for Moving Target Defense in Incomplete Information Game,” Comput. Mater. Contin., vol. 62, no. 2, pp. 763–786, 2020. https://doi.org/10.32604/cmc.2020.06553

BibTex EndNote RIS

Citations

3

[click to view]

Copyright © 2020 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Strategy Selection for Moving Target Defense in Incomplete Information Game

Abstract

Keywords

Cite This Article

Citations

6556

3259

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link