强化学习
更安全的
计算机科学
机器人
动作选择
集合(抽象数据类型)
软件部署
人工智能
动作(物理)
自主机器人
一次性
机器学习
工程类
移动机器人
计算机安全
软件工程
机械工程
物理
量子力学
神经科学
感知
生物
程序设计语言
作者
Weiqiang Wang,Xu Zhou,Benlian Xu,Siwen Chen,Mingli Lu,Jun Li,Yuejiang Gu
出处
期刊:International Journal of Automation and Control
[Inderscience Enterprises Ltd.]
日期:2024-01-01
卷期号:18 (1): 30-52
标识
DOI:10.1504/ijaac.2024.135093
摘要
Unsafe explorations in the training phase hinder the practical deployment of reinforcement learning (RL) on autonomous robots. Some safe RL methods use safety constraints from prior or external knowledge to reduce or avoid unsafe explorations, but such knowledge is usually unavailable in practice, especially in unknown environments. In this work, we propose a few-shot reasoning-based safe reinforcement learning framework that includes a new few-shot learning method with dynamic support set to reason the safety of unexplored actions and hence guide safer action selection. Additionally, it endows robots with the capability of reverting to previous safe states and reflecting on failures to update the dynamic support set and further improve the accuracy of safety reasoning. Experimental results show that our new few-shot learning method is more accurate, and our proposed framework can significantly reduce the number of failures in the learning phase, especially for long-term autonomy.
科研通智能强力驱动
Strongly Powered by AbleSci AI