计算机科学
人工智能
过程(计算)
卷积神经网络
机器学习
深度学习
可用性
合成数据
数据建模
图像(数学)
数据挖掘
模式识别(心理学)
数据库
人机交互
操作系统
作者
Md. Yearat Hossain,Md. Mahbub Hasan Rakib,Ifran Rahman Nijhum
标识
DOI:10.1145/3594409.3594430
摘要
Even though sophisticated deep learning methods are getting better and better day by day, still they rely on a large number of datasets. But it is not always possible to acquire large datasets for all kinds of problems. Though diffusion models are now popular for their creative applications, it is already proven that they can generate better realistic-looking synthetic images compared to Generative Adversarial Networks (GAN). GANs are a popular option for image synthesis that helps the data sampling process for datasets that have low amounts of data or imbalanced data. In our work, we have experimented with a pre-trained text-to-image generation diffusion model for generating datasets for two different classes of problems. These problems are two common problems that can get benefitted from deep learning-based solutions but the lack of datasets hampers the process. We used the diffusion model to generate synthetic images and used those images as the training and validation data for the problems we tried to solve. Then we tested the models with manually collected real-world data and demonstrated the performance of such a method comparatively. From our experiments, we found that the diffusion model can generate realistic images and is up to 50 times faster in data generation compared to the manual human process. Also, in our testing, we found that the Convolutional Neural Networks trained with these synthetic data can achieve up to 80% and 89% accuracy scores.
科研通智能强力驱动
Strongly Powered by AbleSci AI