Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Unsupervised Learning by Predicting Noise

About

Convolutional neural networks provide visual features that perform remarkably well in many computer vision applications. However, training these networks requires significant amounts of supervision. This paper introduces a generic framework to train deep networks, end-to-end, with no supervision. We propose to fix a set of target representations, called Noise As Targets (NAT), and to constrain the deep features to align to them. This domain agnostic approach avoids the standard unsupervised learning issues of trivial solutions and collapsing of features. Thanks to a stochastic batch reassignment strategy and a separable square loss function, it scales to millions of images. The proposed approach produces representations that perform on par with state-of-the-art unsupervised methods on ImageNet and Pascal VOC.

Piotr Bojanowski, Armand Joulin• 2017

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-1k (val)
Top-1 Accuracy59.7
1453
Semantic segmentationPASCAL VOC 2012 (test)
mIoU48
1342
Object DetectionPASCAL VOC 2007 (test)
mAP56.8
821
ClassificationPASCAL VOC 2007 (test)
mAP (%)79.9
217
Semantic segmentationPascal VOC
mIoU0.371
172
Image ClassificationSTL-10--
109
Scene ClassificationPlaces-205 (val)
Top-1 Acc39.4
97
Object DetectionPascal VOC
mAP49.4
88
ClassificationPascal VOC
mAP65.3
27
Showing 9 of 9 rows

Other info

Follow for update