特征(语言学)
计算机科学
增采样
遥感
比例(比率)
探测器
棱锥(几何)
人工智能
骨干网
对象(语法)
目标检测
卷积(计算机科学)
GSM演进的增强数据速率
计算机视觉
图像(数学)
模式识别(心理学)
地质学
人工神经网络
地理
计算机网络
数学
地图学
电信
哲学
语言学
几何学
作者
Jinping Liu,Kunyi Zheng,Xianyi Liu,Pengfei Xu,Ying Zhou
标识
DOI:10.1016/j.imavis.2024.104898
摘要
Object detection in remote sensing images (RSIs) plays a crucial role in aerial and satellite image analysis. Existing methods lack the capability to effectively detect small and multi-scale objects in RSIs. Consequently, achieving an optimal trade-off between speed and accuracy remains unattainable. Extensive investigation reveals that state-of-the-art detectors have largely overlooked two critical aspects: Spatial artifacts from convolution operations and gradient confusion caused by neighboring levels in the Feature Pyramid Network. To address the first problem, we propose adopting a non-reorganized patch-embedding layer in the downsampling stage and a dual-path learning network (DPLNet) as the backbone, which can effectively mitigate the adverse effects of the edge pixel feature bias in feature maps. Additionally, using DPLNet as the backbone network can minimize costs while learning the intrinsic feature information of objects in RSI. For the second aspect, we propose a neighbor-erasing module with only one gradient flow (OGF-NEM). This module utilizes deep features to erase large objects to highlight small objects in shallow features and changes the backpropagation path to prevent the backflow of unreasonable gradients and the erosion of information from neighbor scales. Thus, a novel detector, called SDSDet, is proposed, which achieves excellent performance for small, dense, and multi-scale objects in RSIs. We have conducted exhaustive experiments on DOTA and MS COCO datasets. Specifically, the SDSDet achieves 42.8% AP on DOTA and 33.3% AP on MS COCO, together with nearly 4.87 M model size and 95 FPS.
科研通智能强力驱动
Strongly Powered by AbleSci AI