目标检测
人工智能
航空影像
计算机视觉
比例(比率)
特征(语言学)
计算机科学
航空影像
特征提取
对象(语法)
遥感
图像融合
模式识别(心理学)
传感器融合
融合
图像(数学)
地图学
地理
语言学
哲学
作者
Lingjie Jiang,Baoxi Yuan,Jiawei Du,Boyu Chen,Hanfei Xie,Juan Tian,Ziqi Yuan
标识
DOI:10.1109/tim.2024.3381272
摘要
Unmanned aerial vehicle (UAV) aerial image object detection is a valuable and challenging research field. Despite the breakthrough of deep learning-based object detection networks in natural scenes, UAV images often exhibit characteristics such as a high proportion of small objects, dense distribution, and significant variations in object scales, posing great challenges for accurate detection. To address these issues, we propose an innovative multi-scale feature fusion small object detection network (MFFSODNet). First, concerning the high proportion of small objects in UAV images, an additional tiny object prediction head is introduced instead of the large object prediction head. This approach provides a good detection accuracy of small objects and significantly reduces the parameters. Secondly, to enhance the feature extraction capability of the network for fine-grained information from small objects, a multi-scale feature extraction module (MSFEM) is designed, which could extract rich and valuable multi-scale feature information through convolution operation of different scales on multiple branches. Thirdly, to fuse the fine-grained information from shallow feature maps and the semantic information from deep feature maps, a new bidirectional dense feature pyramid network (BDFPN) is proposed. By expanding the feature pyramid network scale and introducing skip connections, BDFPN achieves efficient multi-scale information fusion. Extensive experiments on the VisDrone and UAVDT benchmark datasets demonstrate that MFFSODNet outperforms state-of-the-art object detection methods and further validate the effectiveness and generalization of MFFSODNet on photovoltaic array defect datasets (PVD).
科研通智能强力驱动
Strongly Powered by AbleSci AI