Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Striving for Simplicity: The All Convolutional Net

About

Most modern convolutional neural networks (CNNs) used for object recognition are built using the same principles: Alternating convolution and max-pooling layers followed by a small number of fully connected layers. We re-evaluate the state of the art for object recognition from small images with convolutional networks, questioning the necessity of different components in the pipeline. We find that max-pooling can simply be replaced by a convolutional layer with increased stride without loss in accuracy on several image recognition benchmarks. Following this finding -- and building on other recent work for finding simple network structures -- we propose a new architecture that consists solely of convolutional layers and yields competitive or state of the art performance on several object recognition datasets (CIFAR-10, CIFAR-100, ImageNet). To analyze the network we introduce a new variant of the "deconvolution approach" for visualizing features learned by CNNs, which can be applied to a broader range of network structures than existing approaches.

Jost Tobias Springenberg, Alexey Dosovitskiy, Thomas Brox, Martin Riedmiller• 2014

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 (test)--
3518
Image ClassificationCIFAR-10 (test)--
3381
Image ClassificationImageNet (val)--
1206
Image ClassificationCIFAR-10 (test)
Accuracy92.8
906
Image ClassificationCIFAR-100--
622
Image ClassificationCIFAR-10
Accuracy92.75
471
ClassificationCIFAR-100 (test)
Accuracy66.29
129
ExplainabilityImageNet (val)
Insertion37.7
104
Image ClassificationCIFAR-10 4,000 labels (test)
Test Error Rate23.33
57
Image ClassificationCIFAR-100 2009 (test)
Accuracy66.29
53
Showing 10 of 39 rows

Other info

Follow for update