强化学习
冲突解决
趋同(经济学)
计算机科学
人工智能
航向(导航)
物理定律
马尔可夫决策过程
人工神经网络
机器学习
数学
物理
工程类
马尔可夫过程
航空航天工程
政治学
法学
经济
统计
量子力学
经济增长
作者
Peng Zhao,Yongming Liu
出处
期刊:IEEE Transactions on Intelligent Transportation Systems
[Institute of Electrical and Electronics Engineers]
日期:2021-05-13
卷期号:23 (7): 8288-8301
被引量:29
标识
DOI:10.1109/tits.2021.3077572
摘要
A novel method for aircraft conflict resolution in air traffic management (ATM) using physics informed deep reinforcement learning (RL) is proposed. The motivation is to integrate prior physics understanding and model in the learning algorithm to facilitate the optimal policy searching and to present human-explainable results for display and decision-making. First, the information of intruders' quantity, speeds, heading angles, and positions are integrated into an image using the solution space diagram (SSD), which is used in the ATM for conflict detection and mitigation. The SSD serves as the prior physics knowledge from the ATM domain which is the input features for learning. A convolution neural network is used with the SSD images for the deep reinforcement learning. Next, an actor-critic network is constructed to learn conflict resolution policy. Several numerical examples are used to illustrate the proposed methodology. Both discrete and continuous RL are explored using the proposed concept of physics informed learning. A detailed comparison and discussion of the proposed algorithm and classical RL-based conflict resolution is given. The proposed approach is able to handle arbitrary number of intruders and also shows faster convergence behavior due to the encoded prior physics understanding. In addition, the learned optimal policy is also beneficial for proper display to support decision-making. Several major conclusions and future work are presented based on the current investigation.
科研通智能强力驱动
Strongly Powered by AbleSci AI