计算机科学
管道(软件)
分割
对象(语法)
人工智能
目标检测
发电机(电路理论)
修补
图像(数学)
像素
生成语法
计算机视觉
模式识别(心理学)
数据挖掘
功率(物理)
程序设计语言
物理
量子力学
作者
Brais Bosquet,Daniel Cores,Lorenzo Seidenari,V.M. Brea,Manuel Mucientes,Alberto Del Bimbo
标识
DOI:10.1016/j.patcog.2022.108998
摘要
Object detection accuracy on small objects, i.e., objects under 32 × 32 pixels, lags behind that of large ones. To address this issue, innovative architectures have been designed and new datasets have been released. Still, the number of small objects in many datasets does not suffice for training. The advent of the generative adversarial networks (GANs) opens up a new data augmentation possibility for training architectures without the costly task of annotating huge datasets for small objects. In this paper, we propose a full pipeline for data augmentation for small object detection which combines a GAN-based object generator with techniques of object segmentation, image inpainting, and image blending to achieve high-quality synthetic data. The main component of our pipeline is DS-GAN, a novel GAN-based architecture that generates realistic small objects from larger ones. Experimental results show that our overall data augmentation method improves the performance of state-of-the-art models up to 11.9% APs@.5 on UAVDT and by 4.7% APs@.5 on iSAID, both for the small objects subset and for a scenario where the number of training instances is limited.
科研通智能强力驱动
Strongly Powered by AbleSci AI