MNIST数据库
卷积神经网络
计算机科学
集合(抽象数据类型)
简单(哲学)
人工神经网络
建筑
人工智能
机器学习
情报检索
艺术
哲学
认识论
视觉艺术
程序设计语言
作者
Patrice Simard,Donald C. Steinkraus,John Platt
标识
DOI:10.1109/icdar.2003.1227801
摘要
Neural networks are a powerful technology forclassification of visual inputs arising from documents.However, there is a confusing plethora of different neuralnetwork methods that are used in the literature and inindustry. This paper describes a set of concrete bestpractices that document analysis researchers can use toget good results with neural networks. The mostimportant practice is getting a training set as large aspossible: we expand the training set by adding a newform of distorted data. The next most important practiceis that convolutional neural networks are better suited forvisual document tasks than fully connected networks. Wepropose that a simple do-it-yourself implementation ofconvolution with a flexible architecture is suitable formany visual document problems. This simpleconvolutional neural network does not require complexmethods, such as momentum, weight decay, structure-dependentlearning rates, averaging layers, tangent prop,or even finely-tuning the architecture. The end result is avery simple yet general architecture which can yieldstate-of-the-art performance for document analysis. Weillustrate our claims on the MNIST set of English digitimages.
科研通智能强力驱动
Strongly Powered by AbleSci AI