计算机科学
棱锥(几何)
目标检测
对象(语法)
人工智能
联营
特征(语言学)
频道(广播)
特征提取
比例(比率)
计算机视觉
模式识别(心理学)
图层(电子)
数据挖掘
遥感
地理
数学
计算机网络
语言学
哲学
化学
几何学
地图学
有机化学
作者
Bingbing Wang,Fengxiang Zhang,Kaipeng Li,Kuijie Shi,Lei Wang,Gang Liu
出处
期刊:Intelligent Data Analysis
[IOS Press]
日期:2023-10-27
卷期号:27 (6): 1725-1739
被引量:1
摘要
Small object detection has a broad application prospect in image processing of unmanned aerial vehicles, autopilot and remote sensing. However, some difficulties exactly exist in small object detection, such as aggregation, occlusion and insufficient feature extraction, resulting in a great challenge for small object detection. In this paper, we propose an improved algorithm for small object detection to address these issues. By using the spatial pyramid to extract multi-scale spatial features and by applying the multi-scale channel attention to capture the global and local semantic features, the spatial pooling pyramid and multi-scale channel attention module (SPP-MSCAM) is constructed. More importantly, the fusion of the shallower layer with higher resolution and a deeper layer with more semantic information is introduced to the neck structure for improving the sensitivity of small object features. A large number of experiments on the VisDrone2019 dataset and the NWPU VHR-10 dataset show that the proposed method significantly improves the Precision, mAP and mAP50 compared to the YOLOv5 method. Meanwhile, it still preserves a considerable real-time performance. Undoubtedly, the improved network proposed in this paper can effectively alleviate the difficulties of aggregation, occlusion and insufficient feature extraction in small object detection, which would be helpful for its potential applications in the future.
科研通智能强力驱动
Strongly Powered by AbleSci AI