判别式
计算机科学
嵌入
人工智能
领域(数学分析)
钥匙(锁)
模式识别(心理学)
代表(政治)
解耦(概率)
对象(语法)
目标检测
计算机视觉
机器学习
数据挖掘
数学
数学分析
计算机安全
控制工程
政治
政治学
法学
工程类
作者
Maozhen Liu,Xiaoguang Di,Wenzhuang Wang
标识
DOI:10.1016/j.knosys.2024.111772
摘要
Most existing few-shot object detection (FSD) methods implicitly assume that the target domain data with few samples conform to the same statistical distribution as the source domain. However, this assumption is impractical, especially when dealing with unconstrained scenarios. Also, the decline of fine-grained hidden samples caused by scene switching, significant morphological changes, etc. has brought great challenges to object detection. To solve the above few-shot cross-domain detection (FS-CDD) problems, in this work, we propose a novel and flexible Human-like Discrimination Network (HDNet), which is composed of four modules. Firstly, multi-level key generation (MKG) fully mining multi-level comprehensive abstract representation. The precious knowledge that contains diverse low and high levels is obtained by three parallel branches. After decoupling the hidden space, the rich patches from each pair of target and source domains are closely matched in the Embedded Space Implicit Association (ESIA) using degree description. In order to enhance the discriminative capability of the model for a few samples, the instances are additionally encoded and corrected at the classification end using the Prediction head with instance embedding(PHIE). Finally, the Adaptive Reweighted Module (ARM) is redesigned to determine coefficient of multiple loss functions, which avoids the improper operation of setting coefficients according to experience in the past. Extensive experiments demonstrate that despite the dual challenges of limited samples and cross-domain scenarios, the proposed HDNet exhibits remarkable performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI