计算机科学
强化学习
电子游戏
钢筋
运动(物理)
博弈论
序贯博弈
人工智能
非合作博弈
电脑游戏
游戏设计
电子游戏设计
人机交互
作者
Ruilong Zhang,Qun Zong,Xiuyun Zhang,Liqian Dou,Bailing Tian
出处
期刊:IEEE transactions on neural networks and learning systems
[Institute of Electrical and Electronics Engineers]
日期:2022-02-14
卷期号:PP
标识
DOI:10.1109/tnnls.2022.3146976
摘要
As one of the tiniest flying objects, unmanned aerial vehicles (UAVs) are often expanded as the ``swarm'' to execute missions. In this article, we investigate the multiquadcopter and target pursuit-evasion game in the obstacles environment. For high-quality simulation of the urban environment, we propose the pursuit-evasion scenario (PES) framework to create the environment with a physics engine, which enables quadcopter agents to take actions and interact with the environment. On this basis, we construct multiagent coronal bidirectionally coordinated with target prediction network (CBC-TP Net) with a vectorized extension of multiagent deep deterministic policy gradient (MADDPG) formulation to ensure the effectiveness of the damaged ``swarm'' system in pursuit-evasion mission. Unlike traditional reinforcement learning, we design a target prediction network (TP Net) innovatively in the common framework to imitate the way of human thinking: situation prediction is always before decision-making. The experiments of the pursuit-evasion game are conducted to verify the state-of-the-art performance of the proposed strategy, both in the normal and antidamaged situations.
科研通智能强力驱动
Strongly Powered by AbleSci AI