人工智能
计算机视觉
计算机科学
姿势
RGB颜色模型
运动(物理)
帧(网络)
对象(语法)
运动估计
任务(项目管理)
工程类
电信
系统工程
作者
Fengjun Mu,Rui Huang,Ao Luo,Xin Li,Jing Qiu,Hong Cheng
标识
DOI:10.1109/iros51168.2021.9636583
摘要
6D object pose estimation is an essential task in vision-based robotic grasping and manipulation. Prior works extract spatial features by fusing the RGB image and depth without considering the temporal motion information, limiting their performance in heavy occlusion robotic grasping scenarios. In this paper, we present an end-to-end model named TemporalFusion, which integrates the temporal motion information from RGB-D images for 6D object pose estimation. The core of proposed TemporalFusion model is to embed and fuse the temporal motion information from multi-frame RGB-D sequences, which could handle heavy occlusion in robotic grasping tasks. Furthermore, the proposed deep model can also obtain stable pose sequences, which is essential for real-time robotic grasping tasks. We evaluated the proposed method in the YCB-Video dataset, and experimental results show our model outperforms state-of-the-art approaches. Our code is available at https://github.com/mufengjun260/TemporalFusion21.
科研通智能强力驱动
Strongly Powered by AbleSci AI