计算机科学
人工智能
分类器(UML)
帕斯卡(单位)
模式识别(心理学)
机器学习
目标检测
通知
政治学
程序设计语言
法学
作者
Di Qi,Jilin Hu,Jianbing Shen
出处
期刊:IEEE transactions on neural networks and learning systems
[Institute of Electrical and Electronics Engineers]
日期:2023-06-02
卷期号:35 (4): 5435-5446
标识
DOI:10.1109/tnnls.2022.3204597
摘要
Few-shot object detection (FSOD), which detects novel objects with only a few training instances, has recently attracted more attention. Previous works focus on making the most use of label information of objects. Still, they fail to consider the structural and semantic information of the image itself and solve the misclassification between data-abundant base classes and data-scarce novel classes efficiently. In this article, we propose FSOD with Self-Supervising and Cooperative Classifier ( $\text {F}\text {S}^{3}\text {C}$ ) approach to deal with those concerns. Specifically, we analyze the underlying performance degradation of novel classes in FSOD and discover that false-positive samples are the main reason. By looking into these false-positive samples, we further notice that misclassifying novel classes as base classes are the main cause. Thus, we introduce double RoI heads into the existing Fast-RCNN to learn more specific features for novel classes. We also consider using self-supervised learning (SSL) to learn more structural and semantic information. Finally, we propose a cooperative classifier (CC) with the base–novel regularization to maximize the interclass variance between base and novel classes. In the experiment, $\text {F}\text {S}^{3}\text {C}$ outperforms all the latest baselines in most cases on PASCAL VOC and COCO.
科研通智能强力驱动
Strongly Powered by AbleSci AI