强化学习
计算机科学
过程(计算)
功能(生物学)
极限(数学)
人工智能
数学
进化生物学
生物
操作系统
数学分析
作者
Yanjun Huang,Yuxiao Gu,Yuan Kang,Shuo Yang,Tao Liu,Hong Chen
出处
期刊:IEEE transactions on intelligent vehicles
[Institute of Electrical and Electronics Engineers]
日期:2023-12-06
卷期号:9 (2): 3509-3519
标识
DOI:10.1109/tiv.2023.3336768
摘要
Mandatory lane-change scenarios are often challenging for autonomous vehicles in complex environments. In this paper, a human-knowledge-enhanced reinforcement learning (RL) method for lane-change decision making is proposed, where the human intelligence is integrated with RL algorithm in a multiple manner. First, this paper constructs a complex ramp-off scenario with congested traffic flow to help agents master lane-change skills. On the basis of the Twin Delayed Deep Deterministic policy gradient (TD3) algorithm, the human prior experience is encoded into reward function and safety constraints offline, and the online guidance of experts is also introduced into the framework, which can limit the unsafe exploration during the training process and provide demonstration in complex scenarios. The experimental results indicate that our method can effectively improve the training efficiency and outperform typical RL method and expert drivers, without specific requirements on the expertise. The proposed method can enhance the learning ability of RL based driving strategies.
科研通智能强力驱动
Strongly Powered by AbleSci AI