Multi-column Deep Neural Networks for Image Classification

About

Traditional methods of computer vision and machine learning cannot match human performance on tasks such as the recognition of handwritten digits or traffic signs. Our biologically plausible deep artificial neural network architectures can. Small (often minimal) receptive fields of convolutional winner-take-all neurons yield large network depth, resulting in roughly as many sparsely connected neural layers as found in mammals between retina and visual cortex. Only winner neurons are trained. Several deep neural columns become experts on inputs preprocessed in different ways; their predictions are averaged. Graphics cards allow for fast training. On the very competitive MNIST handwriting benchmark, our method is the first to achieve near-human performance. On a traffic sign recognition benchmark it outperforms humans by a factor of two. We also improve the state-of-the-art on a plethora of common image classification benchmarks.

Dan Cire\c{s}an, Ueli Meier, Juergen Schmidhuber• 2012

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-10 (test)	--	3381
Image Classification	MNIST (test)	Accuracy99.77	894
Digit Classification	MNIST (test)	Error Rate0.23	94
Image Classification	CIFAR10	--	70
Image Classification	MNIST	Error Rate23	40
Image Classification	MNIST (test)	Error Rate0.23	31
Letter Recognition	NIST SD 19 letters* (26) (test)	Average Error Rate1.83	9
Object Classification	NORB (test)	Error Rate2.7	4
Digit Recognition	NIST SD 19 digits (10) (test)	Average Error Rate0.77	3
Image Classification	HWDB online 1.0	Error Rate5.61	2

Showing 10 of 16 rows

Other info

Follow for update

@wizwand_team Discord