计算机科学
特征(语言学)
稳健性(进化)
判别式
人工智能
模式识别(心理学)
语义特征
串联(数学)
背景(考古学)
特征学习
光学(聚焦)
目标检测
滤波器(信号处理)
计算机视觉
数学
古生物学
语言学
化学
哲学
物理
光学
组合数学
基因
生物化学
生物
作者
Min Wang,Wenzhong Yang,Liejun Wang,Danny Chen,Fuyuan Wei,HaiLaTi KeZiErBieKe,Yuanyuan Liao
标识
DOI:10.1016/j.jvcir.2023.103752
摘要
Due to their inherent characteristics, small objects have weaker feature representation after multiple down-sampling and are even annihilated in the background. FPN’s simple feature concatenation does not fully utilize multi-scale information and introduces irrelevant context into the information transfer, further reducing the detection performance of the small object. To address the above issues, we propose the simple but effective FE-YOLOv5. (1) We designed the feature enhancement module (FEM) to capture more discriminative features of the small object. Global attention and high-level global contextual information are used to guide shallow, high-resolution features. Global attention interacts with cross-dimensional feature interaction and reduces information loss. High-level context complements more detailed semantic information by modeling global relationships through non-local networks. (2) We design the spatially aware module (SAM) to filter spatial information and enhance the robustness of features. Deformable convolution performs sparse sampling and adaptive spatial learning to better focus on foreground objects. According to the experimental results, our proposed FE-YOLOv5 outperforms the other architectures in the VisDrone2019 dataset and Tsinghua-Tencent100K dataset. Compared to YOLOv5, the APS was improved by 2.8% and 2.9%, respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI