棱锥(几何)
计算机科学
弹丸
利用
人工智能
特征(语言学)
任务(项目管理)
图像(数学)
深度学习
模式识别(心理学)
机器学习
一次性
计算机视觉
计算机安全
工程类
化学
有机化学
机械工程
语言学
物理
哲学
系统工程
光学
作者
Hao Tang,Chengcheng Yuan,Zechao Li,Jinhui Tang
标识
DOI:10.1016/j.patcog.2022.108792
摘要
Few-shot fine-grained recognition (FS-FGR) aims to distinguish several highly similar objects from different sub-categories with limited supervision. However, traditional few-shot learning solutions typically exploit image-level features and are committed to capturing global silhouettes while accidentally ignore to exploring local details, resulting in an inevitable problem of inconspicuous but distinguishable information loss. Thus, how to effectively address the fine-grained recognition issue given limited samples still remains a major challenging. In this article, we tend to propose an effective bidirectional pyramid architecture to enhance internal representations of features to cater to fine-grained image recognition task in the few-shot learning scenario. Specifically, we deploy a multi-scale feature pyramid and a multi-level attention pyramid on the backbone network, and progressively aggregated features from different granular spaces via both of them. We then further present an attention-guided refinement strategy in collaboration with a multi-level attention pyramid to reduce the uncertainty brought by backgrounds conditioned by limited samples. In addition, the proposed method is trained with the meta-learning framework in an end-to-end fashion without any extra supervision. Extensive experimental results on four challenging and widely-used fine-grained benchmarks show that the proposed method performs favorably against state-of-the-arts, especially in the one-shot scenarios.
科研通智能强力驱动
Strongly Powered by AbleSci AI