MFFSODNet: Multiscale Feature Fusion Small Object Detection Network for UAV Aerial Images

目标检测人工智能航空影像计算机视觉比例（比率）特征（语言学）计算机科学航空影像特征提取对象（语法）遥感图像融合模式识别（心理学）传感器融合融合图像（数学）地图学地理语言学哲学

作者

Lingjie Jiang,Baoxi Yuan,Jiawei Du,Boyu Chen,Hanfei Xie,Juan Tian,Ziqi Yuan

出处

期刊：IEEE Transactions on Instrumentation and Measurement [Institute of Electrical and Electronics Engineers]
日期：2024-01-01 卷期号：73: 1-14 被引量：112

标识

DOI：10.1109/tim.2024.3381272

摘要

Unmanned aerial vehicle (UAV) aerial image object detection is a valuable and challenging research field. Despite the breakthrough of deep learning-based object detection networks in natural scenes, UAV images often exhibit characteristics such as a high proportion of small objects, dense distribution, and significant variations in object scales, posing great challenges for accurate detection. To address these issues, we propose an innovative multi-scale feature fusion small object detection network (MFFSODNet). First, concerning the high proportion of small objects in UAV images, an additional tiny object prediction head is introduced instead of the large object prediction head. This approach provides a good detection accuracy of small objects and significantly reduces the parameters. Secondly, to enhance the feature extraction capability of the network for fine-grained information from small objects, a multi-scale feature extraction module (MSFEM) is designed, which could extract rich and valuable multi-scale feature information through convolution operation of different scales on multiple branches. Thirdly, to fuse the fine-grained information from shallow feature maps and the semantic information from deep feature maps, a new bidirectional dense feature pyramid network (BDFPN) is proposed. By expanding the feature pyramid network scale and introducing skip connections, BDFPN achieves efficient multi-scale information fusion. Extensive experiments on the VisDrone and UAVDT benchmark datasets demonstrate that MFFSODNet outperforms state-of-the-art object detection methods and further validate the effectiveness and generalization of MFFSODNet on photovoltaic array defect datasets (PVD).

求助该文献

最长约 10秒，即可获得该文献文件

MFFSODNet: Multiscale Feature Fusion Small Object Detection Network for UAV Aerial Images

今日热心研友