计算机科学
目标检测
探测器
人工智能
高斯分布
最小边界框
计算机视觉
先验概率
推论
对象(语法)
采样(信号处理)
超参数
模式识别(心理学)
算法
图像(数学)
电信
贝叶斯概率
物理
量子力学
作者
Fei Liu,Renwen Chen,Junyi Zhang,Kailing Xing,Hao Liu,Jinchang Qin
出处
期刊:IEEE Transactions on Geoscience and Remote Sensing
[Institute of Electrical and Electronics Engineers]
日期:2022-01-01
卷期号:60: 1-15
被引量:22
标识
DOI:10.1109/tgrs.2022.3215472
摘要
Existing anchor-based rotated object detection methods have achieved some amazing results, but these methods require some manual preset anchors, which not only introduce additional hyperparameters but also introduce extra computational burdens. Due to the above drawbacks, anchor-free methods have been rapidly developed in recent years. However, the existing high-performance anchor-free rotated object detection methods are relatively complex and the inference speed is also slow. And Yolo series models not only maintain high-efficiency inference but also keep competitive performance detection performance in the general object detection tasks. Hence, we propose an anchor-free rotated detector based on the YOLOX method for object detection in aerial images. Our methods consist of two improvements: a Refined Rotated Module (RRM) and a new assigner method which is called the Gaussian distribution Sampling Optimal Transport Assignment method (GSOTA). The RRM can align features and get more useful priors for final detector heads. The GSOTA uses Gaussian Distribution to model the oriented bounding box (OBB) firstly, and a Gaussian Center Sampling method (GCS) with maximum classification center mean (MCCM) is proposed to simplify the label Assignment Optimal Transport problem, finally using an improved dynamic top-k strategy to get an approximate solution. Extensive experiments demonstrate that our models can achieve competitive performance in several challenging aerial object detection datasets while keeping the best efficiency. Our R2YOLOX-X model achieves 79.33%, 97.4%, 97.7%, and 92.5% mAP on the DOTA, HRSC2016, UCAS-AOD, and FGSD2021, respectively, while R2YOLOX-S can reach the fastest 58.2 FPS when inferencing on aerial datasets and R2YOLOX-L gets the best speed-accuracy trade-off.
科研通智能强力驱动
Strongly Powered by AbleSci AI