计算机科学
人工智能
生成语法
对抗制
生成对抗网络
模式识别(心理学)
鉴别器
图像(数学)
生成模型
机器学习
深度学习
人工神经网络
作者
Chris Tensmeyer,Mike Brodie,Daniel Saunders,Tony Martinez
出处
期刊:International Conference on Document Analysis and Recognition
日期:2019-09-01
卷期号:: 172-177
被引量:5
标识
DOI:10.1109/icdar.2019.00036
摘要
One of the limitations for using Deep Learning models to solve binarization tasks is that there is a lack of large quantities of labeled data available to train such models. Efforts to create synthetic data for binarization mostly rely on heuristic image processing techniques and generally lack realism. In this work, we propose a method to produce realistic synthetic data using an adversarially trained image translation model. We extend the popular CycleGAN model to be conditioned on the ground truth binarization mask as it translates images from the domain of synthetic images to the domain of real images. For evaluation, we train deep networks on synthetic datasets produced in different ways and measure their performance on the DIBCO datasets. Compared to not pretraining, we reduce error by 13% on average, and compared to pretraining on unrealistic data, we reduce error by 6%. Visually, we show that DGT-CycleGAN model produces more realistic synthetic data than other models.
科研通智能强力驱动
Strongly Powered by AbleSci AI