姿势
计算机科学
人工智能
管道(软件)
计算机视觉
对象(语法)
三维姿态估计
RGB颜色模型
估计员
机器人
目标检测
可用性
集合(抽象数据类型)
机器人学
分割
人机交互
统计
程序设计语言
数学
作者
Timothy Patten,Kiru Park,Markus Leitner,Kevin Wolfram,Markus Vincze
标识
DOI:10.1109/iros51168.2021.9635884
摘要
Object models are highly useful for robots as they enable tasks such as detection, pose estimation and manipulation. However, models are not always easily available, especially in real-world domains of operation such as peoples’ homes. This work presents a pipeline to generate high-quality object reconstructions from human in-hand manipulation to alleviate the necessity of specialised or expensive hardware. Missing data, due to occlusion or unseen sides, is explicitly handled by incorporating shape completion. We demonstrate the usability of the reconstructions by applying a model-based as well as a CNN-based object pose estimator that is trained on synthetic images by employing state-of-the-art texture synthesis. Using our pipeline to cheaply generate object models and synthetic RGB images for training, we achieve competitive performance compared to baselines that require an elaborate set-up to construct models or large amounts of annotated data. Object grasping is also enabled by learning with the reconstructions in simulation, then executing with a real robot. These evaluations show that our reconstructions are comparable to those made under near-perfect conditions and enable 6D object pose estimation as well as real-world grasping.
科研通智能强力驱动
Strongly Powered by AbleSci AI