计算机科学
人工智能
可解释性
无人机
分类器(UML)
对象(语法)
目标检测
机器学习
变压器
模式识别(心理学)
工程类
遗传学
生物
电气工程
电压
作者
Xingkui Zhu,Shuchang Lyu,Xu Wang,Qi Zhao
出处
期刊:International Conference on Computer Vision
日期:2021-10-01
被引量:1188
标识
DOI:10.1109/iccvw54120.2021.00312
摘要
Object detection on drone-captured scenarios is a recent popular task. As drones always navigate in different altitudes, the object scale varies violently, which burdens the optimization of networks. Moreover, high-speed and low-altitude flight bring in the motion blur on the densely packed objects, which leads to great challenge of object distinction. To solve the two issues mentioned above, we propose TPH-YOLOv5. Based on YOLOv5, we add one more prediction head to detect different-scale objects. Then we replace the original prediction heads with Transformer Prediction Heads (TPH) to explore the prediction potential with self-attention mechanism. We also integrate convolutional block attention model (CBAM) to find attention region on scenarios with dense objects. To achieve more improvement of our proposed TPH-YOLOv5, we provide bags of useful strategies such as data augmentation, multi-scale testing, multi-model integration and utilizing extra classifier. Extensive experiments on dataset VisDrone2021 show that TPH-YOLOv5 have good performance with impressive interpretability on drone-captured scenarios. On DET-test-challenge dataset, the AP result of TPH-YOLOv5 are 39.18%, which is better than previous SOTA method (DPNetV3) by 1.81%. On VisDrone Challenge 2021, TPH-YOLOv5 wins 5 th place and achieves well-matched results with 1 st place model (AP 39.43%). Compared to baseline model (YOLOv5), TPH-YOLOv5 improves about 7%, which is encouraging and competitive.
科研通智能强力驱动
Strongly Powered by AbleSci AI