强化学习
计算机科学
欺骗
选择(遗传算法)
计算机安全
人工智能
面子(社会学概念)
深度学习
法学
政治学
社会科学
社会学
作者
Axel Charpentier,Christopher Neal,Nora Cuppens,Frédéric Cuppens,Reda Yaich
标识
DOI:10.1145/3600160.3600176
摘要
As computer networks face increasingly sophisticated attacks there is a need to create adaptive defensive systems that can select appropriate countermeasures to thwart attacks. The use of Deep Reinforcement Learning to train defensive agents is an avenue to study to meet this demand. In this paper we describe a simulated computer network environment wherein we conduct attacks and train defensive agents that employ Moving Target Defense and Deception strategies. We train an attacking agent, using Proximal Policy Optimization, to learn a policy to extract sensitive network data as quickly as possible from the environment. We then train a defending agent to prevent the attacker from reaching its objective. Our results demonstrate how the defender is able to learn a policy to inhibit the attacker.
科研通智能强力驱动
Strongly Powered by AbleSci AI