计算机科学
判别式
人工智能
可扩展性
稳健性(进化)
对象(语法)
机器学习
突出
模式识别(心理学)
生物化学
化学
数据库
基因
作者
Yaqian Zhou,Hao-Chun Lu,Tong Hao,Xuanya Li,An-An Liu
标识
DOI:10.1016/j.inffus.2023.101967
摘要
Existing multi-view object classification algorithms usually rely on sufficient labeled multi-view objects, which substantially restricts their scalability to novel classes with few annotated training samples in real-world applications. Aiming to go beyond these limitations, we explore a novel yet challenging task, few-shot multi-view object classification (FS-MVOC), which expects the network to build its classification ability efficiently based on limited labeled multi-view objects. To this end, we design a dual augmentation network (DANet) to provide excellent performance for the under-explored FS-MVOC task. On the one hand, we employ an attention-guided multi-view representation augmentation (AMRA) strategy to help the model focus on salient features and suppress unnecessary ones on multiple views of multi-view objects, resulting in more discriminative multi-view representations. On the other hand, during the meta-training stage, we adopt the category prototype augmentation (CPA) strategy to improve the class-representativeness of each prototype and increase the inter-prototype difference by injecting Gaussian noise in the deep feature space. Extensive experiments on the benchmark datasets (Meta-ModelNet and Meta-ShapeNet) indicate the effectiveness and robustness of DANet.
科研通智能强力驱动
Strongly Powered by AbleSci AI