鉴别器
计算机科学
人工智能
阈值
预处理器
图像(数学)
相似性(几何)
编码(集合论)
模式识别(心理学)
生成对抗网络
历史文献
发电机(电路理论)
计算机视觉
电信
探测器
功率(物理)
物理
集合(抽象数据类型)
量子力学
程序设计语言
作者
Rajonya De,Anuran Chakraborty,Ram Sarkar
出处
期刊:IEEE Signal Processing Letters
[Institute of Electrical and Electronics Engineers]
日期:2020-01-01
卷期号:27: 1090-1094
被引量:23
标识
DOI:10.1109/lsp.2020.3003828
摘要
For document image analysis, image binarization is an important preprocessing step. Also, binarization can help in improving the readability of old and historical manuscripts. Such documents are generally degraded due to various reasons such as bleed-through, faded ink, or stains. Achieving good binarization performance on these documents is a challenging task. In this letter, a deep learning based model for document image binarization has been proposed, comprising a Dual Discriminator Generative Adversarial Network (DD-GAN) which uses Focal Loss as generator loss. The DD-GAN consists of two discriminator networks - one looks for the global similarity i.e. on the whole image, and another one explores the image in small patches i.e. local similarity. At the final stage, simple thresholding is performed on the generated images. The method has been tested on five recent DIBCO datasets. It has been found that the method is robust and it provides results comparable with state-of-the-art methods. The code for this letter is available at https://github.com/anuran-Chakraborty/BinarizationDualDiscriminatorGAN.
科研通智能强力驱动
Strongly Powered by AbleSci AI