清晨好,您是今天最早来到科研通的研友!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您科研之路漫漫前行!

Survey of small object detection

目标检测 计算机科学 人工智能 卷积神经网络 Viola–Jones对象检测框架 对象类检测 对象(语法) 模式识别(心理学) 计算机视觉 特征提取 定向梯度直方图 特征(语言学) 直方图 人脸检测 图像(数学) 语言学 哲学 面部识别系统
作者
Xiaoying Pan,Ning Jia,Yuanzhen Mu,Xuanrong Gao
出处
期刊:Journal of Image and Graphics 卷期号:28 (9): 2587-2615 被引量:7
标识
DOI:10.11834/jig.220455
摘要

随着计算机视觉和人工智能技术的快速发展,目标检测受到了更加广泛的关注。由于小目标像素占比小、语义信息少、易受复杂场景干扰以及易聚集遮挡等问题,导致小目标检测一直是目标检测领域中的一大难点。目前,视觉的小目标检测在生活的各个领域中日益重要。为了进一步促进小目标检测的发展,提高小目标检测的精度与速度,优化其算法模型,本文针对小目标检测中存在的问题,梳理了国内外研究现状及成果。首先,分别从小目标可视化特征、目标分布情况和检测环境等角度对小目标检测的难点进行了分析,同时从数据增强、超分辨率、多尺度特征融合、上下文语义信息、锚框机制、注意力机制以及特定的检测场景等方面系统总结了小目标检测方法,并整理了在框架结构、损失函数、预测和匹配机制等方面发展的较为成熟的单阶段小目标检测方法。其次,本文对小目标检测的评价指标以及可用于小目标检测的各类数据集进行了详细介绍,并针对部分经典的小目标检测方法在 MSCOCO(Microsoft common objects in context)、VisDrone2021(vision meets drones2021)和 Tsinghua-Tencent100K 等数据集上的检测结果及其可视化检测效果进行了对比与分析。最后,对未来小目标检测面临的挑战,包括如何解决小目标定位困难、网络模型下采样对小目标的影响、交并比阈值的设置对小目标不合理等问题和其对应的研究方向进行了分析与展望。;In recent years, object detection has attracted increasing attention because of the rapid development of computer vision and artificial intelligence technology.Early traditional object detection methods, such as histogram of oriented gradient(HOG)and deformable parts model(DPM)usually adopt three steps:region selection, manual feature extraction, and classification regression.However, manual feature extraction has great limitations for small object detection.The object detection algorithm based on the convolutional neural network can be divided into two-stage and one-stage detection algorithms.Two-stage detection algorithms, such as faster region with convolutional neural network(Faster RCNN)and cascade region with convolutional neural network(Cascade RCNN), select candidate regions through the region proposal network.Then, they classify and regress these regions to obtain the detection results.However, the problem of low accuracy still exists in small object detection.One-stage detection algorithms, such as single shot MultiBox detector(SSD)and you only look once(YOLO), can directly locate the object and output the category detection information of the object, thereby improving the speed of object detection to a certain extent.However, small object detection has always been a huge challenge in the field of object detection because of the small proportion of small object pixels, little semantic information, and small objects that are easily disturbed by complex scenes.In particular, the challenges in object detection are as follows:First, the characteristics of small objects are few.Given the small scale of small objects and the small coverage area in data images, extracting favorable semantic feature information in network training is difficult.Second, small object detection is susceptible to interference.Most of the small objects have low resolution, blurred images, and little visual information.Thus, they are easily disturbed during difficult feature extraction.Thus, the detection model cannot easily locate and identify small objects accurately.Moreover, many false detections and missed detections exist.Third, a shortage of small object datasets exists.At present, most of the mainstream object datasets, such as PASCAL VOC and MS-COCO, are aimed at normal-scale objects.In particular, the proportion of small-scale objects is insufficient, and the distribution is uneven.However, some datasets mentioned in this study that can be used for small object detection are all aimed at specific scenes or tasks.These datasets include DOTA remote sensing object detection dataset, face detection dataset and benchmark, which are not universal for small object detection.Fourth, small objects are easy to gather and block.A serious occlusion problem occurs when small objects gather.After many downsampling and pooling operations, quite a lot of feature information is lost, resulting in some detection difficulties.At present, visual small object detection is increasingly important in all fields of life.Aiming at the problems in small object detection, this study combs the research status and achievements of small object detection at home and abroad to promote the development of small object detection further, improve the speed and accuracy of small object detection, and optimize its algorithm model.The methods of small object detection are analyzed and summarized from the aspects of data enhancement, super resolution, multiscale feature fusion, contextual semantic information, anchor frame mechanism, attention, and specific detection scenarios.Data enhancement is the method proposed for solving the problems of a few general small object datasets, a small number of small objects in public datasets, and uneven distribution of small objects in images.The earliest data enhancement strategy is to increase the number of object training and improve the performance of object detection by deforming, rotating, scaling, cutting, and translating object instances.Then, other effective data augmentation methods emerged, which included oversampling the images containing small objects in the experiment, scaling and rotating the small objects, and copying the objects to any new position in order to augment the data.Data enhancement helps improve the robustness of a model to a certain extent.Moreover, it solves the problems of unobvious visual features of small objects and less object information.It also achieves good results in the final detection performance.However, the improper design of data enhancement strategy in practical applications may lead to new noise, impairing the performance of feature extraction.This scenario also brings some challenges to the design of the algorithm.The small object detection method based on multiscale fusion needs to make full use of the detailed information in the image because the characteristic information of small-scale objects is little.In the existing convolutional neural network(CNN)model of general object detection, multiscale detection can help the model to obtain accurate positioning information and discriminating feature information by using a low-level feature layer.This scenario is conducive to the detection and recognition of small-scale objects.First, a feature pyramid network(FPN)with strong semantic features at all scales is introduced.Then, an fpn-based path aggregation network(PANet), which not only achieved good results in case segmentation but also improved the detection of small objects.In feature fusion, the residual feature enhancement method extracts the context information with a constant ratio to reduce the information loss of the highest pyramid feature map.At present, many methods are based on multiscale feature fusion, which uses the low-level highresolution and high-level strong feature semantic information of the network to improve the accuracy of small objects.In small object detection, the target's feature expression ability is weak.Thus, the network structure must be deepened to learn considerable feature information.Introducing an attention mechanism can often make the network model pay considerable attention to the channels and areas related to the task.In the object detection network, the shallow feature map lacks the contextual semantic information of small objects.By incorporating attention mechanisms into the SSD model, irrelevant information in feature fusion is suppressed, leading to an improvement in the detection accuracy of small objects.In general, the attention mechanism can reasonably allocate the used resources, quickly find the region of interest, and ignore disturbing information.However, the improper design in use increases the cost of network calculation and affects the extraction of object features by the model.Finally, the future research direction of small object detection is prospected.Visual small object detection is becoming increasingly important in all fields of life, and it will develop in other directions in the future.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Akim应助科研通管家采纳,获得10
1秒前
czj完成签到 ,获得积分10
28秒前
vassallo完成签到 ,获得积分10
38秒前
雪山飞龙完成签到,获得积分10
1分钟前
Jiangshan完成签到 ,获得积分10
1分钟前
旧城旧巷等旧人完成签到 ,获得积分10
1分钟前
王磊完成签到 ,获得积分10
1分钟前
万能图书馆应助baobeikk采纳,获得10
1分钟前
新奇完成签到 ,获得积分10
1分钟前
1分钟前
baobeikk发布了新的文献求助10
1分钟前
蝈蝈应助科研通管家采纳,获得10
2分钟前
蝴蝶完成签到 ,获得积分10
2分钟前
田田完成签到 ,获得积分10
2分钟前
Strive发布了新的文献求助10
2分钟前
ldjldj_2004完成签到 ,获得积分10
2分钟前
Strive完成签到,获得积分10
2分钟前
虚幻元风完成签到 ,获得积分10
2分钟前
轩辕剑身完成签到,获得积分0
2分钟前
DJ_Tokyo完成签到,获得积分10
3分钟前
研友_Ljqal8完成签到,获得积分10
3分钟前
冷傲的帽子完成签到 ,获得积分10
3分钟前
fengfenghao完成签到,获得积分10
3分钟前
小西完成签到 ,获得积分10
3分钟前
科研通AI2S应助科研通管家采纳,获得10
4分钟前
科研通AI2S应助科研通管家采纳,获得10
4分钟前
蝈蝈应助科研通管家采纳,获得10
4分钟前
英喆完成签到 ,获得积分10
4分钟前
无辜的行云完成签到 ,获得积分0
5分钟前
就叫希望吧完成签到 ,获得积分10
5分钟前
5分钟前
5分钟前
若眠完成签到 ,获得积分10
5分钟前
蝈蝈应助科研通管家采纳,获得10
6分钟前
文献搬运工完成签到 ,获得积分10
6分钟前
淡淡醉波wuliao完成签到 ,获得积分10
6分钟前
hongt05完成签到 ,获得积分10
6分钟前
稳重傲晴完成签到 ,获得积分10
6分钟前
oaoalaa完成签到 ,获得积分10
6分钟前
小瞎子_Zora完成签到 ,获得积分10
6分钟前
高分求助中
Histotechnology: A Self-Instructional Text 5th Edition 2000
Rock-Forming Minerals, Volume 3C, Sheet Silicates: Clay Minerals 2000
The late Devonian Standard Conodont Zonation 2000
Nickel superalloy market size, share, growth, trends, and forecast 2023-2030 2000
The Lali Section: An Excellent Reference Section for Upper - Devonian in South China 1500
PraxisRatgeber: Mantiden: Faszinierende Lauerjäger 800
The Healthy Socialist Life in Maoist China 600
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 催化作用 物理化学 免疫学 量子力学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 3275201
求助须知:如何正确求助?哪些是违规求助? 2914192
关于积分的说明 8371642
捐赠科研通 2585067
什么是DOI,文献DOI怎么找? 1407396
科研通“疑难数据库(出版商)”最低求助积分说明 656880
邀请新用户注册赠送积分活动 637385