Small object detection in unmanned aerial vehicle images using multi-scale hybrid attention

计算机科学人工智能特征（语言学）目标检测比例（比率）光学（聚焦）对象（语法）相似性（几何）模式识别（心理学）计算机视觉频道（广播）代表（政治）探测器低分辨率图像（数学）高分辨率遥感电信哲学语言学计算机网络物理量子力学政治法学政治学光学地质学

作者

Gang Song,Hongwei Du,Xinyue Zhang,Fangxun Bao,Yunfeng Zhang

出处

期刊：Engineering Applications of Artificial Intelligence [Elsevier BV]
日期：2023-11-09 卷期号：128: 107455-107455 被引量：6

标识

DOI：10.1016/j.engappai.2023.107455

摘要

Small object detection in unmanned aerial vehicle images is always challenging due to the low resolution and the limited amount of information that they contain. Many feature enhancement effects have been introduced to improve the detection of small objects, but the extracted effective information is still insufficient, and redundant information interference is an issue. In this paper, we propose a new multi-scale hybrid attention based detector (MHA-YOLOv5), which integrates the similarity relationships between objects into you only look once version 5 (YOLOv5) for small object detection. Specifically, a novel multi-scale hybrid attention (MHA) structure is proposed to enhance the feature representation of small objects. This structure contains three modules: multi-scale attention (MsA), foreground enhancement module (FEM) and depthwise separable channel attention (DSCA). The MsA module is designed to build connections between large objects with abundant details and small objects with insufficient features on multiple scale features and capture the similarity relationships between objects. To reduce the interference of redundant information, the FEM is used to focus on the foreground features of multiple scale features, and the DSCA module is utilized to effectively extract multidimensional channel information. Sufficient experiments on the challenging VisDrone2019-DET, UAVDT and CARPK datasets demonstrate the effectiveness and superiority of the proposed approach. Specifically, compared with the performance of YOLOv5, MHA-YOLOv5 demonstrates a 2.82% mean average precision (mAP) improvement on the VisDrone2019-DET dataset, a 2.25% mAP improvement on the UAVDT dataset, and a 3.07% mAP improvement on the CARPK dataset.

求助该文献

最长约 10秒，即可获得该文献文件

Small object detection in unmanned aerial vehicle images using multi-scale hybrid attention

今日热心研友