Scaling simulation-to-real transfer by learning a latent space of robot skills

计算机科学机器人人工智能机器学习任务（项目管理）稳健性（进化）集合（抽象数据类型）空格（标点符号）工程类生物化学基因操作系统化学程序设计语言系统工程

作者

Ryan Julian,Eric Heiden,Zhanpeng He,Hejia Zhang,Stefan Schaal,Joseph J. Lim,Gaurav S. Sukhatme,Karol Hausman

出处

期刊：The International Journal of Robotics Research [SAGE Publishing]
日期：2020-08-21 卷期号：39 (10-11): 1259-1278 被引量：2

链接

sagepub.comdoi.org

标识

DOI：10.1177/0278364920944474

摘要

We present a strategy for simulation-to-real transfer, which builds on recent advances in robot skill decomposition. Rather than focusing on minimizing the simulation–reality gap, we propose a method for increasing the sample efficiency and robustness of existing simulation-to-real approaches which exploits hierarchy and online adaptation. Instead of learning a unique policy for each desired robotic task, we learn a diverse set of skills and their variations, and embed those skill variations in a continuously parameterized space. We then interpolate, search, and plan in this space to find a transferable policy which solves more complex, high-level tasks by combining low-level skills and their variations. In this work, we first characterize the behavior of this learned skill space, by experimenting with several techniques for composing pre-learned latent skills. We then discuss an algorithm which allows our method to perform long-horizon tasks never seen in simulation, by intelligently sequencing short-horizon latent skills. Our algorithm adapts to unseen tasks online by repeatedly choosing new skills from the latent space, using live sensor data and simulation to predict which latent skill will perform best next in the real world. Importantly, our method learns to control a real robot in joint-space to achieve these high-level tasks with little or no on-robot time, despite the fact that the low-level policies may not be perfectly transferable from simulation to real, and that the low-level skills were not trained on any examples of high-level tasks. In addition to our results indicating a lower sample complexity for families of tasks, we believe that our method provides a promising template for combining learning-based methods with proven classical robotics algorithms such as model-predictive control.

求助该文献

最长约 10秒，即可获得该文献文件

Scaling simulation-to-real transfer by learning a latent space of robot skills

今日热心研友