计算机科学
人工智能
任务(项目管理)
构造(python库)
领域(数学分析)
姿势
强化学习
计算机视觉
光学(聚焦)
机械臂
适应(眼睛)
领域(数学)
机器学习
人机交互
工程类
数学分析
物理
数学
系统工程
光学
纯数学
程序设计语言
作者
Yiming Zuo,Weichao Qiu,Lingxi Xie,Fangwei Zhong,Yizhou Wang,Alan Yuille
标识
DOI:10.1109/cvpr.2019.00434
摘要
Training a robotic arm to accomplish real-world tasks has been attracting increasing attention in both academia and industry. This work discusses the role of computer vision algorithms in this field. We focus on low-cost arms on which no sensors are equipped and thus all decisions are made upon visual recognition, e.g., real-time 3D pose estimation. This requires annotating a lot of training data, which is not only time-consuming but also laborious. In this paper, we present an alternative solution, which uses a 3D model to create a large number of synthetic data, trains a vision model in this virtual domain, and applies it to real-world images after domain adaptation. To this end, we design a semi-supervised approach, which fully leverages the geometric constraints among keypoints. We apply an iterative algorithm for optimization. Without any annotations on real images, our algorithm generalizes well and produces satisfying results on 3D pose estimation, which is evaluated on two real-world datasets. We also construct a vision-based control system for task accomplishment, for which we train a reinforcement learning agent in a virtual environment and apply it to the real-world. Moreover, our approach, with merely a 3D model being required, has the potential to generalize to other types of multi-rigid-body dynamic systems.
科研通智能强力驱动
Strongly Powered by AbleSci AI