计算机科学
人工智能
模式识别(心理学)
指纹(计算)
作者
Xujun Peng,Chao Wang,Huaigu Cao
出处
期刊:International Conference on Document Analysis and Recognition
日期:2019-09-01
卷期号:: 45-50
被引量:11
标识
DOI:10.1109/icdar.2019.00017
摘要
Document binarization which separates text from background is a critical pre-processing step for many high level document analysis tasks. Conventional document binarization approaches tend to use hand-craft features and empirical rules to simulate the degradation process of document image and accomplish the binarization task. In this paper, we propose a deep learning framework where the probability of text areas is inferred through a multi-resolutional attention model, which is consequently fed into a convolutional conditional random field (ConvCRF) to obtain the final binarized document image. In the proposed approach, the features of degraded document image are learned by neural networks and the relations between text areas and backgrounds are inferred by ConvCRF, which avoids the dependence of domain knowledge from researchers and has more generalization capabilities. The experimental results on public datasets show that the proposed method has superior binarization performance than the existing state-of-the-art approaches.
科研通智能强力驱动
Strongly Powered by AbleSci AI