计算机科学
人工智能
对象(语法)
傅里叶级数
推论
傅里叶变换
系列(地层学)
代表(政治)
目标检测
模式识别(心理学)
计算机视觉
过程(计算)
特征提取
特征(语言学)
视觉对象识别的认知神经科学
算法
数学
法学
数学分析
古生物学
哲学
操作系统
政治
生物
语言学
政治学
作者
Jin Liu,Zhongyuan Lu,Yingying Cen,Hui Hu,Zhenfeng Shao,Yong Hong,Ming Jiang,Miaozhong Xu
标识
DOI:10.1109/tpami.2025.3526990
摘要
Traditional object detection models often lose the detailed outline information of the object. To address this problem, we propose the Fourier Series Object Detection (FSD). It encodes the object's outline closed curve into two one-dimensional periodic Fourier series. The Fourier Series Model (FSM) is constructed to regress the Fourier series for each object in the image. Thus, during inference, the detailed outline information of each object can be retrieved. We introduce Rolling Optimization Matching for Fourier loss to ensure that the model's learning process is not affected by the sequence of the starting points of the labeled contour points, speeding up the training process. The FSM demonstrates improved feature extraction and descriptive capabilities for non-rectangular or elongated object regions. The model achieves AP50=73.3% on the DOTA 1.5 dataset, which surpasses the state-of-the-art (SOTA) method by 6.44% at 66.86%. On the UCAS dataset, the model achieves AP50=97.25%, also surpassing the performance indicators of the SOTA methods. Furthermore, we introduce the object's Fourier power spectrum to describe outline features and the Fourier vector to indicate its direction. This enhances the scene semantic representation of the object detection model and paves a new pathway for the evolution of object detection methodologies.
科研通智能强力驱动
Strongly Powered by AbleSci AI