Neural Computation
December 2010, Vol. 22, No. 12, Pages 3207-3220
(doi: 10.1162/NECO_a_00052)
Deep, Big, Simple Neural Nets for Handwritten Digit Recognition
Article PDF (555.86 KB)
Abstract
Good old online backpropagation for plain multilayer perceptrons yields a very low 0.35% error rate on the MNIST handwritten digits benchmark. All we need to achieve this best result so far are many hidden layers, many neurons per layer, numerous deformed training images to avoid overfitting, and graphics cards to greatly speed up learning.