Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Revisiting Self-Supervised Visual Representation Learning

About

Unsupervised visual representation learning remains a largely unsolved problem in computer vision research. Among a big body of recently proposed approaches for unsupervised learning of visual representations, a class of self-supervised techniques achieves superior performance on many challenging benchmarks. A large number of the pretext tasks for self-supervised learning have been studied, but other important aspects, such as the choice of convolutional neural networks (CNN), has not received equal attention. Therefore, we revisit numerous previously proposed self-supervised models, conduct a thorough large scale study and, as a result, uncover multiple crucial insights. We challenge a number of common practices in selfsupervised visual representation learning and observe that standard recipes for CNN design do not always translate to self-supervised representation learning. As part of our study, we drastically boost the performance of previously proposed techniques and outperform previously published state-of-the-art results by a large margin.

Alexander Kolesnikov, Xiaohua Zhai, Lucas Beyer• 2019

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-1k (val)
Top-1 Accuracy54
1453
Image ClassificationImageNet (val)
Top-1 Acc76
1206
Image ClassificationImageNet 1k (test)
Top-1 Accuracy55.4
798
Image ClassificationImageNet (val)
Top-1 Accuracy53.7
354
Image ClassificationImageNet (10% labels)--
98
Image ClassificationPlaces--
72
Image ClassificationPlaces205 (val)
Top-1 Accuracy61.6
68
Image ClassificationPlaces 205-way (test)
Top-1 Accuracy45.9
38
Image ClassificationILSVRC 12
Top-1 Acc18.2
31
Image ClassificationImageNet 1% labels ILSVRC-2012 (val)
Top-5 Acc45.11
30
Showing 10 of 13 rows

Other info

Code

Follow for update