R2YOLOX: A Lightweight Refined Anchor-Free Rotated Detector for Object Detection in Aerial Images

计算机科学目标检测探测器人工智能高斯分布最小边界框计算机视觉先验概率推论对象（语法）采样（信号处理）超参数模式识别（心理学）算法图像（数学）电信贝叶斯概率物理量子力学

作者

Fei Liu,Renwen Chen,Junyi Zhang,Kailing Xing,Hao Liu,Jinchang Qin

出处

期刊：IEEE Transactions on Geoscience and Remote Sensing [Institute of Electrical and Electronics Engineers]
日期：2022-01-01 卷期号：60: 1-15 被引量：22

标识

DOI：10.1109/tgrs.2022.3215472

摘要

Existing anchor-based rotated object detection methods have achieved some amazing results, but these methods require some manual preset anchors, which not only introduce additional hyperparameters but also introduce extra computational burdens. Due to the above drawbacks, anchor-free methods have been rapidly developed in recent years. However, the existing high-performance anchor-free rotated object detection methods are relatively complex and the inference speed is also slow. And Yolo series models not only maintain high-efficiency inference but also keep competitive performance detection performance in the general object detection tasks. Hence, we propose an anchor-free rotated detector based on the YOLOX method for object detection in aerial images. Our methods consist of two improvements: a Refined Rotated Module (RRM) and a new assigner method which is called the Gaussian distribution Sampling Optimal Transport Assignment method (GSOTA). The RRM can align features and get more useful priors for final detector heads. The GSOTA uses Gaussian Distribution to model the oriented bounding box (OBB) firstly, and a Gaussian Center Sampling method (GCS) with maximum classification center mean (MCCM) is proposed to simplify the label Assignment Optimal Transport problem, finally using an improved dynamic top-k strategy to get an approximate solution. Extensive experiments demonstrate that our models can achieve competitive performance in several challenging aerial object detection datasets while keeping the best efficiency. Our R2YOLOX-X model achieves 79.33%, 97.4%, 97.7%, and 92.5% mAP on the DOTA, HRSC2016, UCAS-AOD, and FGSD2021, respectively, while R2YOLOX-S can reach the fastest 58.2 FPS when inferencing on aerial datasets and R2YOLOX-L gets the best speed-accuracy trade-off.

求助该文献

最长约 10秒，即可获得该文献文件

R2YOLOX: A Lightweight Refined Anchor-Free Rotated Detector for Object Detection in Aerial Images

今日热心研友