计算机科学
特征(语言学)
人工智能
棱锥(几何)
冗余(工程)
目标检测
骨干网
比例(比率)
计算机视觉
背景(考古学)
特征提取
模式识别(心理学)
数学
地理
计算机网络
哲学
语言学
几何学
地图学
考古
操作系统
作者
Zhaodong Chen,Hongbing Ji,Yongquan Zhang,Zhigang Zhu,Yifan Li
出处
期刊:IEEE Transactions on Circuits and Systems for Video Technology
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:34 (1): 475-489
被引量:9
标识
DOI:10.1109/tcsvt.2023.3286896
摘要
Object detection has developed rapidly with the help of deep learning technologies recent years. However, object detection on drone view remains challenging due to two main reasons: (1) It is difficult to detect small-scale objects lacking detailed information. (2) The diversity of camera angles of drones brings dramatic differences in object scale. Although feature pyramid network (FPN) alleviates the problem caused by scale difference to some extent, it also retains some worthless features, which wastes resources and slows down the speed. In this work, we propose a novel High-Resolution Feature Pyramid Network (HR-FPN) to improve the detection accuracy of small-scale objects and avoid feature redundancy. The key components of HR-FPN include a high-resolution feature alignment module (HRFA), a high-resolution feature fusion module (HRFF) and a multi-scale decoupled head (MSDH). HRFA feeds multi-scale features from backbone into parallel resampling channels to obtain high-resolution features at the same scale. HRFF establishes a bottom-up path to distribute context-rich low-level semantic information to all layers that are then aggregated into classification feature and localization feature. MSDH cope with the scale difference of objects by predicting the categories and locations corresponding to different scales of objects separately. Moreover, we train model by scale-weighted loss to focus more on small-scale objects. Extensive experiments and comprehensive evaluations demonstrate the effectiveness and advancement of our approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI