计算机科学
动作(物理)
过程(计算)
人工智能
适应(眼睛)
对抗制
模棱两可
动作选择
选择(遗传算法)
空战
机器学习
强化学习
运筹学
模拟
操作系统
光学
物理
工程类
生物
量子力学
神经科学
程序设计语言
感知
作者
Zhixiao Sun,Haiyin Piao,Zhen Yang,Yiyang Zhao,Guang Zhan,Deyun Zhou,Guanglei Meng,Hechang Chen,Xing Chen,Bohao Qu,Yuanjie Lu
标识
DOI:10.1016/j.engappai.2020.104112
摘要
Air-to-air confrontation has attracted wide attention from artificial intelligence scholars. However, in the complex air combat process, operational strategy selection depends heavily on aviation expert knowledge, which is usually expensive and difficult to obtain. Moreover, it is challenging to select optimal action sequences efficiently and accurately with existing methods, due to the high complexity of action selection when involving hybrid actions, e.g., discrete/continuous actions. In view of this, we propose a novel Multi-Agent Hierarchical Policy Gradient algorithm (MAHPG), which is capable of learning various strategies and transcending expert cognition by adversarial self-play learning. Besides, a hierarchical decision network is adopted to deal with the complicated and hybrid actions. It has a hierarchical decision-making ability similar to humankind, and thus, reduces the action ambiguity efficiently. Extensive experimental results demonstrate that the MAHPG outperforms the state-of-the-art air combat methods in terms of both defense and offense ability. Notably, it is discovered that the MAHPG has the ability of Air Combat Tactics Interplay Adaptation, and new operational strategies emerged that surpass the level of experts.
科研通智能强力驱动
Strongly Powered by AbleSci AI