解码方法
计算机科学
编码(内存)
旋转(数学)
代表(政治)
相(物质)
算法
人工智能
探测器
目标检测
对象(语法)
方向(向量空间)
计算机视觉
数学
模式识别(心理学)
物理
几何学
电信
量子力学
政治
政治学
法学
作者
Fei Liu,Renwen Chen,Junyi Zhang,Shanshan Ding,Hao Liu,Shaofei Ma,Kailing Xing
出处
期刊:IEEE Geoscience and Remote Sensing Letters
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:21: 1-5
标识
DOI:10.1109/lgrs.2024.3397852
摘要
The classically oriented object detection method often suffers boundary discontinuity and square-like problems, hindering the model's ability to predict orientation accurately. Therefore, in this letter, we introduce a novel angle representation scheme named Coupled Dual-Frequency Phase-Shifting Coder (CDFP) which draws inspiration from optical measurement technology to address the aforementioned issues. Specifically, in the angle encoding stage, we represent the ground truth rotation angle as a combination of two 5-step phase-shifting at different frequencies and use this representation to supervise the learning of the model's angle branch. In the angle decoding stage, alongside utilizing the corresponding dual-frequency phase-shifting decoding and unwrapping method, we impose additional constraints on the decoding angle range for predicted square-like objects. Extensive experiments on three challenging aerial image datasets using different detectors prove the effectiveness of our approach. Specifically, our RetinaNet-CDFP achieves an average improvement of 2.16% AP50 and 6.83% AP75 on DOTA, and when combined with RTMDet, our RTMDet-R-m-CDFP achieves state-of-the-art detection performance on DIOR-R and DOTA, with 70.11% and 78.77% AP50, respectively. Our codes will be released at https://github.com/liufeinuaa/aisodet.git.
科研通智能强力驱动
Strongly Powered by AbleSci AI