Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Data-dependent Initializations of Convolutional Neural Networks

About

Convolutional Neural Networks spread through computer vision like a wildfire, impacting almost all visual tasks imaginable. Despite this, few researchers dare to train their models from scratch. Most work builds on one of a handful of ImageNet pre-trained models, and fine-tunes or adapts these for specific tasks. This is in large part due to the difficulty of properly initializing these networks from scratch. A small miscalibration of the initial weights leads to vanishing or exploding gradients, as well as poor convergence properties. In this work we present a fast and simple data-dependent initialization procedure, that sets the weights of a network such that all units in the network train at roughly the same rate, avoiding vanishing or exploding gradients. Our initialization matches the current state-of-the-art unsupervised or self-supervised pre-training methods on standard computer vision tasks, such as image classification and object detection, while being roughly three orders of magnitude faster. When combined with pre-training methods, our initialization significantly outperforms prior work, narrowing the gap between supervised and unsupervised pre-training.

Philipp Kr\"ahenb\"uhl, Carl Doersch, Jeff Donahue, Trevor Darrell• 2015

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC 2012 (val)
Mean IoU32.6
2040
Image ClassificationImageNet-1k (val)
Top-1 Accuracy24.5
1453
Semantic segmentationPASCAL VOC 2012 (test)
mIoU32.6
1342
Object DetectionPASCAL VOC 2007 (test)
mAP45.6
821
ClassificationPASCAL VOC 2007 (test)
mAP (%)56.6
217
Image ClassificationImageNet 2012 (val)
Top-1 Accuracy24.5
202
Scene ClassificationPlaces-205 (val)
Top-1 Acc27.1
97
Image ClassificationPlaces
Top-1 Accuracy27.1
72
Image ClassificationImageNet 1000 classes 16
Top-1 Acc24.5
70
Perceptual SimilarityBAPPS (val)
2AFC (Overall)66.6
39
Showing 10 of 11 rows

Other info

Follow for update