计算机科学
杠杆(统计)
人工智能
启发式
对象(语法)
分割
边距(机器学习)
概化理论
领域(数学分析)
姿势
透视图(图形)
分类
机器学习
对抗制
构造(python库)
成对比较
语义学(计算机科学)
数学
操作系统
统计
数学分析
程序设计语言
作者
Haoran Geng,Helin Xu,Chengyang Zhao,Chao Xu,Yi Li,Siyuan Huang,He Wang
标识
DOI:10.1109/cvpr52729.2023.00684
摘要
For years, researchers have been devoted to generalizable object perception and manipulation, where cross-category generalizability is highly desired yet underexplored. In this work, we propose to learn such cross-category skills via Generalizable and Actionable Parts (GAParts). By identifying and defining 9 GAPart classes (lids, handles, etc.) in 27 object categories, we construct a large-scale part-centric interactive dataset, GAPartNet, where we provide rich, part-level annotations (semantics, poses) for 8,489 part instances on 1,166 objects. Based on GAPartNet, we investigate three cross-category tasks: part segmentation, part pose estimation, and partbased object manipulation. Given the significant domain gaps between seen and unseen object categories, we propose a robust 3D segmentation method from the perspective of domain generalization by integrating adversarial learning techniques. Our method outperforms all existing methods by a large margin, no matter on seen or unseen categories. Furthermore, with part segmentation and pose estimation results, we leverage the GAPart pose definition to design part-based manipulation heuristics that can generalize well to unseen object categories in both the simulator and the real world.
科研通智能强力驱动
Strongly Powered by AbleSci AI