强化学习
空战
计算机科学
人工智能
任务(项目管理)
领域(数学分析)
构造(python库)
深度学习
模拟
工程类
系统工程
数学分析
数学
程序设计语言
作者
Johan Källström,Fredrik Heintz
标识
DOI:10.1109/smc42975.2020.9283492
摘要
Simulation-based training has the potential to significantly improve training value in the air combat domain. However, synthetic opponents must be controlled by high-quality behavior models, in order to exhibit human-like behavior. Building such models by hand is recognized as a very challenging task. In this work, we study how multi-agent deep reinforcement learning can be used to construct behavior models for synthetic pilots in air combat simulation. We empirically evaluate a number of approaches in two air combat scenarios, and demonstrate that curriculum learning is a promising approach for handling the high-dimensional state space of the air combat domain, and that multi-objective learning can produce synthetic agents with diverse characteristics, which can stimulate human pilots in training.
科研通智能强力驱动
Strongly Powered by AbleSci AI