闭塞
计算机视觉
人工智能
计算机科学
任务(项目管理)
特征(语言学)
过程(计算)
模式识别(心理学)
工程类
医学
语言学
操作系统
哲学
心脏病学
系统工程
作者
Xuexue Li,Wenhui Diao,Yiqi Mao,Peng Gao,Xiuhua Mao,Xinming Li,Xian Sun
出处
期刊:Isprs Journal of Photogrammetry and Remote Sensing
日期:2023-05-01
卷期号:199: 242-257
被引量:3
标识
DOI:10.1016/j.isprsjprs.2023.04.009
摘要
Occlusion between objects is one of the overlooked challenges for object detection in UAV images. Due to the variable altitude and angle of UAVs, occlusion in UAV images happens more frequently than that in natural scenes. Compared to occlusion in natural scene images, occlusion in UAV images happens with feature confusion problem and local aggregation characteristic. And we found that extracting or localizing occlusion between objects is beneficial for the detector to address this challenge. According to this finding, the occlusion localization task is introduced, which together with the object detection task constitutes our occlusion-guided multi-task network (OGMN). The OGMN contains the localization of occlusion and two occlusion-guided multi-task interactions. In detail, an occlusion estimation module (OEM) is proposed to precisely localize occlusion. Then the OGMN utilizes the occlusion localization results to implement occlusion-guided detection with two multi-task interactions. One interaction for the guide is between two task decoders to address the feature confusion problem, and an occlusion decoupling head (ODH) is proposed to replace the general detection head. Another interaction for guide is designed in the detection process according to local aggregation characteristic, and a two-phase progressive refinement process (TPP) is proposed to optimize the detection process. Extensive experiments demonstrate the effectiveness of our OGMN on the Visdrone and UAVDT datasets. In particular, our OGMN achieves 35.0% mAP on the Visdrone dataset and outperforms the baseline by 5.3%. And our OGMN provides a new insight for accurate occlusion localization and achieves competitive detection performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI