计算机科学
目标检测
行人检测
特征提取
人工智能
行人
最小边界框
趋同(经济学)
跳跃式监视
卷积(计算机科学)
计算
深度学习
特征(语言学)
人工神经网络
计算机视觉
模式识别(心理学)
图像(数学)
算法
哲学
经济
工程类
经济增长
语言学
运输工程
作者
Xiaxia Zhang,Ning Li,Ruixin Zhang
出处
期刊:International Conference on Consumer Electronics
日期:2021-01-15
被引量:20
标识
DOI:10.1109/iccece51280.2021.9342416
摘要
Recently, most object detection under videos have increasingly relied on the Unmanned Aerial Vehicle (UAV) platforms because of UAVs' timeliness, pertinence, and high flexibility in data acquisition. Convolution neural networks, especially for YOLO v3, have proved to be effective in intelligent pedestrian detection. However, two problems need to be solved in pedestrian detection of UAV images. One is more small pedestrian objects in UAV images; the other is the complex structure of Darknet53 in YOLO v3, which requires massive computation. To solve these problems, an improved lightweight network MobileNetv3 based on YOLO v3 is proposed. First, the improved MobileNetv3 takes place of the Darknet53 for feature extraction to reduce algorithm complexity and model simplify. Second, complete IoU loss by incorporating the overlap area, central point distance and aspect ratio in bounding box regression, is introduced into YOLO v3 to lead to faster convergence and better performance. Moreover, a new attention module SESAM is constructed by channel attention and spatial attention in MobileNetv3. It can effectively judge long-distance and small-volume objects. The experimental results have shown that the proposed model improves the performance of pedestrian detection of UAV images.
科研通智能强力驱动
Strongly Powered by AbleSci AI