强化学习
计算机科学
语义推理机
水准点(测量)
过程(计算)
弹道
人工智能
损害赔偿
人机交互
编码器
操作系统
物理
政治学
大地测量学
法学
地理
天文
作者
Changxin Huang,Ronghui Zhang,Meizi Ouyang,Pengxu Wei,Junfan Lin,Jiang Su,Liang Lin
出处
期刊:IEEE transactions on neural networks and learning systems
[Institute of Electrical and Electronics Engineers]
日期:2021-12-01
卷期号:32 (12): 5379-5391
被引量:14
标识
DOI:10.1109/tnnls.2021.3109284
摘要
Existing deep reinforcement learning (RL) are devoted to research applications on video games, e.g., The Open Racing Car Simulator (TORCS) and Atari games. However, it remains under-explored for vision-based autonomous urban driving navigation (VB-AUDN). VB-AUDN requires a sophisticated agent working safely in structured, changing, and unpredictable environments; otherwise, inappropriate operations may lead to irreversible or catastrophic damages. In this work, we propose a deductive RL (DeRL) to address this challenge. A deduction reasoner (DR) is introduced to endow the agent with ability to foresee the future and to promote policy learning. Specifically, DR first predicts future transitions through a parameterized environment model. Then, DR conducts self-assessment at the predicted trajectory to perceive the consequences of current policy resulting in a more reliable decision-making process. Additionally, a semantic encoder module (SEM) is designed to extract compact driving representation from the raw images, which is robust to the changes of the environment. Extensive experimental results demonstrate that DeRL outperforms the state-of-the-art model-free RL approaches on the public CAR Learning to Act (CARLA) benchmark and presents a superior performance on success rate and driving safety for goal-directed navigation.
科研通智能强力驱动
Strongly Powered by AbleSci AI