计算机科学
人工智能
模式识别(心理学)
图像(数学)
分割
图像分割
像素
计算机视觉
作者
Seokjun Kang,Brian Kenji Iwana,Seiichi Uchida
出处
期刊:International Conference on Document Analysis and Recognition
日期:2019-09-01
卷期号:: 675-680
被引量:6
标识
DOI:10.1109/icdar.2019.00113
摘要
In recent years, U-Net has achieved good results in various image processing tasks. However, conventional U-Nets need to be re-trained for individual tasks with enough amount of images with ground-truth. This requirement makes U-Net not applicable to tasks with small amounts of data. In this paper, we propose to use U-Nets, each of which is pre-trained to perform an existing image processing task, such as dilation, erosion, and histogram equalization. Then, to accomplish a specific image processing task, such as binarization of historical document images, the modular U-Nets are cascaded with inter-module skip connections and fine-tuned to the target task. We verified the proposed model using the Document Image Binarization Competition (DIBCO) 2017 dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI