透视失真
单应性
计算机科学
人工智能
卷积神经网络
透视图(图形)
失真(音乐)
管道(软件)
模式识别(心理学)
计算机视觉
比例(比率)
图像(数学)
数学
放大器
计算机网络
统计
投射试验
带宽(计算)
射影空间
程序设计语言
物理
量子力学
作者
Syed Ammar Abbas,Sibt ul Hussain
出处
期刊:Cornell University - arXiv
日期:2017-01-01
被引量:6
标识
DOI:10.48550/arxiv.1709.03524
摘要
Removing perspective distortion from hand held camera captured document images is one of the primitive tasks in document analysis, but unfortunately, no such method exists that can reliably remove the perspective distortion from document images automatically. In this paper, we propose a convolutional neural network based method for recovering homography from hand-held camera captured documents. Our proposed method works independent of document's underlying content and is trained end-to-end in a fully automatic way. Specifically, this paper makes following three contributions: Firstly, we introduce a large scale synthetic dataset for recovering homography from documents images captured under different geometric and photometric transformations; secondly, we show that a generic convolutional neural network based architecture can be successfully used for regressing the corners positions of documents captured under wild settings; thirdly, we show that L1 loss can be reliably used for corners regression. Our proposed method gives state-of-the-art performance on the tested datasets, and has potential to become an integral part of document analysis pipeline.
科研通智能强力驱动
Strongly Powered by AbleSci AI