Survey of small object detection

目标检测 计算机科学 人工智能 卷积神经网络 Viola–Jones对象检测框架 对象类检测 对象(语法) 模式识别(心理学) 计算机视觉 特征提取 定向梯度直方图 特征(语言学) 直方图 人脸检测 图像(数学) 语言学 哲学 面部识别系统
作者
Xiaoying Pan,Ning Jia,Yuanzhen Mu,Xuanrong Gao
出处
期刊:Journal of Image and Graphics [University of Portsmouth]
卷期号:28 (9): 2587-2615 被引量:9
标识
DOI:10.11834/jig.220455
摘要

随着计算机视觉和人工智能技术的快速发展,目标检测受到了更加广泛的关注。由于小目标像素占比小、语义信息少、易受复杂场景干扰以及易聚集遮挡等问题,导致小目标检测一直是目标检测领域中的一大难点。目前,视觉的小目标检测在生活的各个领域中日益重要。为了进一步促进小目标检测的发展,提高小目标检测的精度与速度,优化其算法模型,本文针对小目标检测中存在的问题,梳理了国内外研究现状及成果。首先,分别从小目标可视化特征、目标分布情况和检测环境等角度对小目标检测的难点进行了分析,同时从数据增强、超分辨率、多尺度特征融合、上下文语义信息、锚框机制、注意力机制以及特定的检测场景等方面系统总结了小目标检测方法,并整理了在框架结构、损失函数、预测和匹配机制等方面发展的较为成熟的单阶段小目标检测方法。其次,本文对小目标检测的评价指标以及可用于小目标检测的各类数据集进行了详细介绍,并针对部分经典的小目标检测方法在 MSCOCO(Microsoft common objects in context)、VisDrone2021(vision meets drones2021)和 Tsinghua-Tencent100K 等数据集上的检测结果及其可视化检测效果进行了对比与分析。最后,对未来小目标检测面临的挑战,包括如何解决小目标定位困难、网络模型下采样对小目标的影响、交并比阈值的设置对小目标不合理等问题和其对应的研究方向进行了分析与展望。;In recent years, object detection has attracted increasing attention because of the rapid development of computer vision and artificial intelligence technology.Early traditional object detection methods, such as histogram of oriented gradient(HOG)and deformable parts model(DPM)usually adopt three steps:region selection, manual feature extraction, and classification regression.However, manual feature extraction has great limitations for small object detection.The object detection algorithm based on the convolutional neural network can be divided into two-stage and one-stage detection algorithms.Two-stage detection algorithms, such as faster region with convolutional neural network(Faster RCNN)and cascade region with convolutional neural network(Cascade RCNN), select candidate regions through the region proposal network.Then, they classify and regress these regions to obtain the detection results.However, the problem of low accuracy still exists in small object detection.One-stage detection algorithms, such as single shot MultiBox detector(SSD)and you only look once(YOLO), can directly locate the object and output the category detection information of the object, thereby improving the speed of object detection to a certain extent.However, small object detection has always been a huge challenge in the field of object detection because of the small proportion of small object pixels, little semantic information, and small objects that are easily disturbed by complex scenes.In particular, the challenges in object detection are as follows:First, the characteristics of small objects are few.Given the small scale of small objects and the small coverage area in data images, extracting favorable semantic feature information in network training is difficult.Second, small object detection is susceptible to interference.Most of the small objects have low resolution, blurred images, and little visual information.Thus, they are easily disturbed during difficult feature extraction.Thus, the detection model cannot easily locate and identify small objects accurately.Moreover, many false detections and missed detections exist.Third, a shortage of small object datasets exists.At present, most of the mainstream object datasets, such as PASCAL VOC and MS-COCO, are aimed at normal-scale objects.In particular, the proportion of small-scale objects is insufficient, and the distribution is uneven.However, some datasets mentioned in this study that can be used for small object detection are all aimed at specific scenes or tasks.These datasets include DOTA remote sensing object detection dataset, face detection dataset and benchmark, which are not universal for small object detection.Fourth, small objects are easy to gather and block.A serious occlusion problem occurs when small objects gather.After many downsampling and pooling operations, quite a lot of feature information is lost, resulting in some detection difficulties.At present, visual small object detection is increasingly important in all fields of life.Aiming at the problems in small object detection, this study combs the research status and achievements of small object detection at home and abroad to promote the development of small object detection further, improve the speed and accuracy of small object detection, and optimize its algorithm model.The methods of small object detection are analyzed and summarized from the aspects of data enhancement, super resolution, multiscale feature fusion, contextual semantic information, anchor frame mechanism, attention, and specific detection scenarios.Data enhancement is the method proposed for solving the problems of a few general small object datasets, a small number of small objects in public datasets, and uneven distribution of small objects in images.The earliest data enhancement strategy is to increase the number of object training and improve the performance of object detection by deforming, rotating, scaling, cutting, and translating object instances.Then, other effective data augmentation methods emerged, which included oversampling the images containing small objects in the experiment, scaling and rotating the small objects, and copying the objects to any new position in order to augment the data.Data enhancement helps improve the robustness of a model to a certain extent.Moreover, it solves the problems of unobvious visual features of small objects and less object information.It also achieves good results in the final detection performance.However, the improper design of data enhancement strategy in practical applications may lead to new noise, impairing the performance of feature extraction.This scenario also brings some challenges to the design of the algorithm.The small object detection method based on multiscale fusion needs to make full use of the detailed information in the image because the characteristic information of small-scale objects is little.In the existing convolutional neural network(CNN)model of general object detection, multiscale detection can help the model to obtain accurate positioning information and discriminating feature information by using a low-level feature layer.This scenario is conducive to the detection and recognition of small-scale objects.First, a feature pyramid network(FPN)with strong semantic features at all scales is introduced.Then, an fpn-based path aggregation network(PANet), which not only achieved good results in case segmentation but also improved the detection of small objects.In feature fusion, the residual feature enhancement method extracts the context information with a constant ratio to reduce the information loss of the highest pyramid feature map.At present, many methods are based on multiscale feature fusion, which uses the low-level highresolution and high-level strong feature semantic information of the network to improve the accuracy of small objects.In small object detection, the target's feature expression ability is weak.Thus, the network structure must be deepened to learn considerable feature information.Introducing an attention mechanism can often make the network model pay considerable attention to the channels and areas related to the task.In the object detection network, the shallow feature map lacks the contextual semantic information of small objects.By incorporating attention mechanisms into the SSD model, irrelevant information in feature fusion is suppressed, leading to an improvement in the detection accuracy of small objects.In general, the attention mechanism can reasonably allocate the used resources, quickly find the region of interest, and ignore disturbing information.However, the improper design in use increases the cost of network calculation and affects the extraction of object features by the model.Finally, the future research direction of small object detection is prospected.Visual small object detection is becoming increasingly important in all fields of life, and it will develop in other directions in the future.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
刚刚
刚刚
今后应助zkm采纳,获得10
刚刚
my196755完成签到,获得积分10
刚刚
sun完成签到,获得积分10
刚刚
缓慢板栗完成签到,获得积分10
刚刚
再见不难完成签到,获得积分10
1秒前
燚燚完成签到,获得积分10
1秒前
曾经的朝雪完成签到 ,获得积分10
1秒前
共享精神应助悠旷采纳,获得10
1秒前
tttck发布了新的文献求助10
1秒前
ySX应助研友_V8QBrL采纳,获得10
2秒前
深情的依风完成签到,获得积分10
2秒前
2秒前
斯文败类应助sw采纳,获得10
2秒前
酷炫的冷梅完成签到,获得积分10
2秒前
2秒前
3秒前
布洛芬发布了新的文献求助10
3秒前
sos完成签到,获得积分10
3秒前
小萌新完成签到,获得积分10
3秒前
赖晨靓完成签到 ,获得积分10
4秒前
kyt发布了新的文献求助10
4秒前
bio-tang完成签到,获得积分10
5秒前
CQMZY_2025完成签到,获得积分10
5秒前
5秒前
嘟嘟大王拯救世界完成签到,获得积分10
5秒前
马博的司机完成签到,获得积分10
6秒前
姚昂完成签到,获得积分10
6秒前
ygwu0946发布了新的文献求助10
6秒前
han完成签到,获得积分10
6秒前
眯眯眼的谷冬完成签到 ,获得积分10
6秒前
6秒前
英吉利25发布了新的文献求助10
7秒前
8秒前
8秒前
9秒前
烟花应助傻傻的咖啡豆采纳,获得10
9秒前
鹏鹏完成签到,获得积分10
10秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Salmon nasal cartilage-derived proteoglycan complexes influence the gut microbiota and bacterial metabolites in mice 2000
The Composition and Relative Chronology of Dynasties 16 and 17 in Egypt 1500
Cowries - A Guide to the Gastropod Family Cypraeidae 1200
ON THE THEORY OF BIRATIONAL BLOWING-UP 666
Signals, Systems, and Signal Processing 610
“美军军官队伍建设研究”系列(全册) 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6384630
求助须知:如何正确求助?哪些是违规求助? 8197620
关于积分的说明 17336693
捐赠科研通 5438242
什么是DOI,文献DOI怎么找? 2876052
邀请新用户注册赠送积分活动 1852566
关于科研通互助平台的介绍 1696978