棱锥(几何)
目标检测
计算机科学
特征(语言学)
人工智能
对象(语法)
特征提取
模式识别(心理学)
过程(计算)
计算机视觉
探测器
传感器融合
融合
数学
几何学
操作系统
哲学
电信
语言学
作者
Shifeng Dong,Rujing Wang,Jianming Du,Lin Jiao
标识
DOI:10.1117/1.jei.32.1.013045
摘要
Scale variation is one of the challenges of object detection. Most state-of-the-art object detectors depend on feature pyramid networks (FPN) for multiscale learning to deal with this problem, in which feature fusion is an essential operation. However, feature fusion does not sufficiently address the difficulty of the detection task. This paper presents an enhancement-fusion feature pyramid network (EFPN) to obtain reliable object representations for object detectors. Specifically, it contains a feature enhancement module (FEM) and a bottom-up path module (BPM). The FEM is used to eliminate the negative impact of the uneven distribution of object scales on the model performance. Then, a BPM is proposed to address the fusion inconsistency in the FPN. Additionally, an attention module (Ac) is added to eliminate the information loss in the bottom-up aggregation process. EFPN is evaluated by combining it with state-of-the-art detection methods. Extensive experimental results on two datasets MS-COCO and VOC2007 demonstrate the effectiveness of the proposed method.
科研通智能强力驱动
Strongly Powered by AbleSci AI