强化学习
加速度
弹道
避碰
计算机科学
二次规划
控制理论(社会学)
高斯过程
碰撞
数学优化
人工智能
高斯分布
数学
控制(管理)
物理
经典力学
天文
计算机安全
量子力学
作者
Yifan Hu,Junjie Fu,Guanghui Wen
出处
期刊:IEEE transactions on intelligent vehicles
[Institute of Electrical and Electronics Engineers]
日期:2023-01-02
卷期号:8 (3): 2332-2344
被引量:24
标识
DOI:10.1109/tiv.2022.3233592
摘要
Applying reinforcement learning (RL) algorithms to control systems design remains a challenging task due to the potential unsafe exploration and the low sample efficiency. In this paper, we propose a novel safe model-based RL algorithm to solve the collision-free model-reference trajectory tracking problem of uncertain autonomous vehicles (AVs). Firstly, a new type of robust control barrier function (CBF) condition for collision-avoidance is derived for the uncertain AVs by incorporating the estimation of the system uncertainty with Gaussian process (GP) regression. Then, a robust CBF-based RL control structure is proposed, where the nominal control input is composed of the RL policy and a model-based reference control policy. The actual control input obtained from the quadratic programming problem can satisfy the constraints of collision-avoidance, input saturation and velocity boundedness simultaneously with a relatively high probability. Finally, within this control structure, a Dyna-style safe model-based RL algorithm is proposed, where the safe exploration is achieved through executing the robust CBF-based actions and the sample efficiency is improved by leveraging the GP models. The superior learning performance of the proposed RL control structure is demonstrated through simulation experiments.
科研通智能强力驱动
Strongly Powered by AbleSci AI