计算机科学
强化学习
任务(项目管理)
人工智能
机器人
控制(管理)
趋同(经济学)
机器人学习
机械手
机器学习
移动机器人
经济增长
经济
管理
标识
DOI:10.1016/j.engappai.2022.105753
摘要
Current methods of reinforcement learning from expert demonstrations require humans to give all possible demonstrations in the learning phase, which is very difficult for continuous or high-dimensional spaces. In this paper, we proposed biased exploration reinforcement learning to avoid the exploration of unnecessary states and actions of the expert demonstrations. We present a convergence analysis of the novel method. This method is applied to learn the control of a redundant robot manipulator with 7-degree-of-freedom. The experimental results demonstrate that the proposed method accelerates the learning phase. The obtained policy can successfully achieve the pretended task.
科研通智能强力驱动
Strongly Powered by AbleSci AI