人工智能
计算机视觉
姿势
计算机科学
对象(语法)
水准点(测量)
RGB颜色模型
匹配(统计)
视频跟踪
机器人
跟踪(教育)
编码(集合论)
目标检测
模式识别(心理学)
数学
心理学
教育学
统计
大地测量学
集合(抽象数据类型)
程序设计语言
地理
作者
Chen Wang,Roberto Martín-Martín,Danfei Xu,Jun Lv,Cewu Lu,Li Fei-Fei,Silvio Savarese,Yuke Zhu
标识
DOI:10.1109/icra40945.2020.9196679
摘要
We present 6-PACK, a deep learning approach to category-level 6D object pose tracking on RGB-D data. Our method tracks in real time novel object instances of known object categories such as bowls, laptops, and mugs. 6-PACK learns to compactly represent an object by a handful of 3D keypoints, based on which the interframe motion of an object instance can be estimated through keypoint matching. These keypoints are learned end-to-end without manual supervision in order to be most effective for tracking. Our experiments show that our method substantially outperforms existing methods on the NOCS category-level 6D pose estimation benchmark and supports a physical robot to perform simple vision-based closed-loop manipulation tasks. Our code and video are available at https://sites.google.com/view/6packtracking.
科研通智能强力驱动
Strongly Powered by AbleSci AI