Model-Based Reinforcement Learning Variable Impedance Control for Human-Robot Collaboration

机器人 强化学习 工程类 阻抗控制 人机交互 控制工程 灵活性(工程) 控制理论(社会学) 人工智能 计算机科学 模拟 控制器(灌溉) 控制(管理) 统计 农学 生物 数学
作者
Loris Roveda,Jeyhoon Maskani,Paolo Franceschi,Arash Abdi,Francesco Braghin,Lorenzo Molinari Tosatti,Nicola Pedrocchi
出处
期刊:Journal of Intelligent and Robotic Systems [Springer Nature]
卷期号:100 (2): 417-433 被引量:107
标识
DOI:10.1007/s10846-020-01183-3
摘要

Industry 4.0 is taking human-robot collaboration at the center of the production environment. Collaborative robots enhance productivity and flexibility while reducing human’s fatigue and the risk of injuries, exploiting advanced control methodologies. However, there is a lack of real-time model-based controllers accounting for the complex human-robot interaction dynamics. With this aim, this paper proposes a Model-Based Reinforcement Learning (MBRL) variable impedance controller to assist human operators in collaborative tasks. More in details, an ensemble of Artificial Neural Networks (ANNs) is used to learn a human-robot interaction dynamic model, capturing uncertainties. Such a learned model is kept updated during collaborative tasks execution. In addition, the learned model is used by a Model Predictive Controller (MPC) with Cross-Entropy Method (CEM). The aim of the MPC+CEM is to online optimize the stiffness and damping impedance control parameters minimizing the human effort (i.e, minimizing the human-robot interaction forces). The proposed approach has been validated through an experimental procedure. A lifting task has been considered as the reference validation application (weight of the manipulated part: 10 kg unknown to the robot controller). A KUKA LBR iiwa 14 R820 has been used as a test platform. Qualitative performance (i.e, questionnaire on perceived collaboration) have been evaluated. Achieved results have been compared with previous developed offline model-free optimized controllers and with the robot manual guidance controller. The proposed MBRL variable impedance controller shows improved human-robot collaboration. The proposed controller is capable to actively assist the human in the target task, compensating for the unknown part weight. The human-robot interaction dynamic model has been trained with a few initial experiments (30 initial experiments). In addition, the possibility to keep the learning of the human-robot interaction dynamics active allows accounting for the adaptation of human motor system.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
科研通AI6.1应助ZHX采纳,获得10
2秒前
王婷发布了新的文献求助10
3秒前
王w发布了新的文献求助30
4秒前
量子星尘发布了新的文献求助10
4秒前
刘莉莉发布了新的文献求助10
4秒前
金白发布了新的文献求助10
5秒前
雨天发布了新的文献求助10
5秒前
9秒前
Fannie发布了新的文献求助10
10秒前
识字岭的岭应助隋阳采纳,获得10
10秒前
万能图书馆应助xuxubaibai采纳,获得10
10秒前
小番茄发布了新的文献求助10
10秒前
10秒前
bkagyin应助罗杨采纳,获得10
11秒前
梵高晚风完成签到,获得积分10
11秒前
无名氏完成签到,获得积分10
12秒前
12秒前
烟花应助王富贵采纳,获得10
13秒前
13秒前
13秒前
littleknees发布了新的文献求助30
14秒前
快乐树发布了新的文献求助10
14秒前
科研通AI6.1应助dede采纳,获得20
14秒前
SciGPT应助开朗醉波采纳,获得10
15秒前
15秒前
15秒前
16秒前
16秒前
细腻初雪发布了新的文献求助10
17秒前
无极微光应助无心的鹤采纳,获得20
17秒前
像风一样自由完成签到 ,获得积分10
18秒前
烂漫岱周发布了新的文献求助10
18秒前
19秒前
一路向北发布了新的文献求助10
20秒前
ran123456完成签到,获得积分10
20秒前
倪莎完成签到,获得积分10
21秒前
缓慢的含海完成签到 ,获得积分10
21秒前
打工肥仔应助温瞳采纳,获得10
21秒前
21秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Handbook of pharmaceutical excipients, Ninth edition 5000
Aerospace Standards Index - 2026 ASIN2026 3000
Signals, Systems, and Signal Processing 610
Discrete-Time Signals and Systems 610
Social Work and Social Welfare: An Invitation(7th Edition) 410
Medical Management of Pregnancy Complicated by Diabetes 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 物理 生物化学 化学工程 计算机科学 复合材料 内科学 催化作用 光电子学 物理化学 电极 冶金 遗传学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 6055778
求助须知:如何正确求助?哪些是违规求助? 7885071
关于积分的说明 16288557
捐赠科研通 5201104
什么是DOI,文献DOI怎么找? 2782979
邀请新用户注册赠送积分活动 1765773
关于科研通互助平台的介绍 1646704