帕斯卡(单位)
计算机科学
特征(语言学)
人工智能
模式识别(心理学)
棱锥(几何)
卷积神经网络
目标检测
特征提取
探测器
对象(语法)
数学
电信
哲学
语言学
几何学
程序设计语言
作者
Wenjie Lin,Jun Chu,Lu Leng,Jun Miao,Lingfeng Wang
标识
DOI:10.1016/j.patcog.2023.109878
摘要
In this paper, an enhanced disentanglement module is proposed to address feature misalignment caused by inherently irreconcilable conflicts between classification and regression tasks in Convolutional Neural Network-based object detectors. The proposed method disentangles features in the feature pyramid network (FPN) at the neck of the architecture. In addition, a response alignment strategy is proposed to reduce inconsistent responses and suppress inferior predictions. Extensive experiments are performed on the MS COCO and PASCAL VOC datasets with different backbones, confirming that the proposed method improves performance significantly. The proposed method exhibits two main advantages over existing solutions—features are disentangled at the neck instead of at the head, enabling comprehensive resolution of feature misalignment, and independent outputs of the two tasks after feature disentanglement are avoided, thereby preventing response inconsistencies.
科研通智能强力驱动
Strongly Powered by AbleSci AI