发布文献求助

Reinforcement learning from expert demonstrations with application to redundant robot control

计算机科学强化学习任务（项目管理）人工智能机器人控制（管理）趋同（经济学）机器人学习机械手机器学习移动机器人管理经济经济增长

作者

Jorge Ramírez,Wen Yu

出处

期刊：Engineering Applications of Artificial Intelligence [Elsevier BV]
日期：2023-03-01 卷期号：119: 105753-105753 被引量：10

标识

DOI：10.1016/j.engappai.2022.105753

摘要

Current methods of reinforcement learning from expert demonstrations require humans to give all possible demonstrations in the learning phase, which is very difficult for continuous or high-dimensional spaces. In this paper, we proposed biased exploration reinforcement learning to avoid the exploration of unnecessary states and actions of the expert demonstrations. We present a convergence analysis of the novel method. This method is applied to learn the control of a redundant robot manipulator with 7-degree-of-freedom. The experimental results demonstrate that the proposed method accelerates the learning phase. The obtained policy can successfully achieve the pretended task.

求助该文献

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: wyx发布了新的文献求助10

1秒前; why完成签到，获得积分10

1秒前; 吴凡发布了新的文献求助10

3秒前; 濠哥妈咪发布了新的文献求助10

3秒前; 8R60d8上传了应助文件

5秒前; Fox发布了新的文献求助10

6秒前; 浮游上传了应助文件

8秒前; wzq发布了新的文献求助30

9秒前; 桐桐上传了应助文件

9秒前; 吴凡完成签到，获得积分10

11秒前; Fox完成签到，获得积分10

11秒前; 科研通AI5的应助被研究僧-卓采纳，获得10

12秒前; 肉肉肉发布了新的文献求助10

14秒前; 烂漫映之完成签到，获得积分10

15秒前; 嘟嘟完成签到，获得积分10

16秒前; 小女完成签到，获得积分10

18秒前; 8R60d8上传了应助文件

18秒前; 林白生完成签到，获得积分10

19秒前; 汉堡包的应助被Luo采纳，获得10

21秒前; 科研通AI5上传了应助文件

22秒前; 研究僧-卓发布了新的文献求助10

27秒前; 星辰大海的应助被眼睛大墨镜采纳，获得10

29秒前; 8R60d8上传了应助文件

30秒前; Orange的应助被suchui采纳，获得10

30秒前; 没有密码关注了科研通微信公众号

33秒前; JamesPei的应助被zhuge采纳，获得10

33秒前; 肉肉肉完成签到，获得积分10

33秒前; Lucas上传了应助文件

33秒前; dsfsd完成签到，获得积分10

34秒前; vizi上传了应助文件

35秒前; jenningseastera上传了应助文件

36秒前; 涵涵涵hh发布了新的文献求助10

36秒前; 8R60d8上传了应助文件

37秒前; 桐桐上传了应助文件

40秒前; Orange上传了应助文件

42秒前; lvlvlvsh发布了新的文献求助10

44秒前; CipherSage上传了应助文件

44秒前; LL完成签到，获得积分10

45秒前; 思源上传了应助文件

48秒前; Huang完成签到，获得积分10

48秒前

高分求助中: (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; Manipulating the Mouse Embryo: A Laboratory Manual, Fourth Edition 1000; Determination of the boron concentration in diamond using optical spectroscopy 600; The Netter Collection of Medical Illustrations: Digestive System, Volume 9, Part III - Liver, Biliary Tract, and Pancreas （3rd Edition） 600; Founding Fathers The Shaping of America 500; A new house rat (Mammalia: Rodentia: Muridae) from the Andaman and Nicobar Islands 500; On the Validity of the Independent-Particle Model and the Sum-rule Approach to the Deeply Bound States in Nuclei 220

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 4545613; 求助须知：如何正确求助？哪些是违规求助？ 3977172; 关于积分的说明 12315867; 捐赠科研通 3645354; 什么是DOI，文献DOI怎么找？ 2007536; 邀请新用户注册赠送积分活动 1043147; 科研通“疑难数据库（出版商）”最低求助积分说明 931989

今日热心研友

紧张的惜梦

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通