帧(网络)
缓冲器(光纤)
人工智能
计算机科学
计算机视觉
离群值
滤波器(信号处理)
图像(数学)
模式识别(心理学)
电信
作者
Yang Wang,Hongguang Li,Xinjun Li,Zhipeng Wang,Baochang ZHANG
标识
DOI:10.1016/j.cja.2024.02.014
摘要
With rapid development of UAV technology, research on UAV image analysis has gained attention. As the existing techniques of UAV target localization often rely on additional equipment, a method of UAV target localization based on depth estimation has been proposed. However, the unique perspective of UAVs poses challenges such as the significant field of view variations and the presence of dynamic objects in the scene. As a result, the existing methods of depth estimation and scale recovery cannot be directly applied to UAV perspectives. Additionally, there is a scarcity of depth estimation datasets tailored for UAV perspectives, which makes supervised algorithms impractical. To address these issues, an outlier filter is introduced to enhance the applicability of depth estimation networks to target localization. A frame buffer method is proposed to achieve more accurate scale recovery, so as to handle complex scene textures in UAV images. The proposed method demonstrates a 14.29% improvement over the baseline. Compared with the average recovery results from UAV perspectives, the difference is only 0.88%, approaching the performance of scale recovery using ground truth labels. Furthermore, to overcome the limited availability of traditional UAV depth datasets, a method for generating depth labels from video sequences is proposed. Compared to state-of-the-art methods, the proposed approach achieves higher accuracy in depth estimation and stands for the first attempt at target localization using image sequences. Proposed algorithm and dataset are available at https://github.com/uav-tan/uav-object-localization.
科研通智能强力驱动
Strongly Powered by AbleSci AI