人工智能
计算机科学
探测器
计算机视觉
目标检测
特征(语言学)
卷积(计算机科学)
模式识别(心理学)
水准点(测量)
深度学习
低空
高度(三角形)
对象(语法)
比例(比率)
人工神经网络
地理
地图学
数学
电信
语言学
哲学
几何学
作者
Payal Mittal,Akashdeep Sharma,Raman Singh,Vishal Dhull
标识
DOI:10.1016/j.eswa.2022.117106
摘要
The low-altitude aerial objects are hard to detect by existing deep learning-based object detectors because of the scale variance, small size, and occlusion-related problems. Deep learning-based detectors do not consider contextual information about the scale information of small-sized objects in low-altitude aerial images. This paper proposes a new system using the concept of receptive fields and fusion of feature maps to improve the efficiency of deep object detectors in low-altitude aerial images. A Dilated ResNet Module (DRM) is proposed, motivated from the trident networks, which works on dilated convolutions to study the contextual data for specifically small-sized objects. Applicability of this component builds the model strong towards scale variations in low-altitude aerial objects. Then, Feature Fusion Module (FFM) is created to offer semantic intelligence for better detection of low-altitude aerial objects. We have chosen vastly deployed faster RCNN as the base detector for the proposal of our technique. The dilated convolution-based RCNN using feature fusion (DCRFF) system is implemented on a benchmark low-altitude UAV based-object detection dataset, VisDrone, which contains multiple object categories of pedestrians, vehicles in crowded scenes. The experiments exhibit the enactment of the given detector on chosen low-altitude aerial object dataset. The proposed system of DCRFF achieves 35.04% mAP on the challenging VisDrone dataset, indicating an average improvement of 2% when compared.
科研通智能强力驱动
Strongly Powered by AbleSci AI