Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Billion-scale semi-supervised learning for image classification

About

This paper presents a study of semi-supervised learning with large convolutional networks. We propose a pipeline, based on a teacher/student paradigm, that leverages a large collection of unlabelled images (up to 1 billion). Our main goal is to improve the performance for a given target architecture, like ResNet-50 or ResNext. We provide an extensive analysis of the success factors of our approach, which leads us to formulate some recommendations to produce high-accuracy models for image classification with semi-supervised learning. As a result, our approach brings important gains to standard architectures for image, video and fine-grained classification. For instance, by leveraging one billion unlabelled images, our learned vanilla ResNet-50 achieves 81.2% top-1 accuracy on the ImageNet benchmark.

I. Zeki Yalniz, Herv\'e J\'egou, Kan Chen, Manohar Paluri, Dhruv Mahajan• 2019

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-1k (val)
Top-1 Accuracy84.8
1453
Image ClassificationImageNet (val)
Top-1 Acc81.2
1206
Fine-grained Image ClassificationCUB200 2011 (test)
Accuracy84.8
536
Image ClassificationImageNet
Top-1 Accuracy84.8
429
Image ClassificationImageNet ILSVRC-2012 (val)
Top-1 Accuracy84.8
405
Video RecognitionKinetics (val)
Top-1 Accuracy76.7
36
Instance SegmentationHabitat Gibson Generalization 1.0 (val)
Mask AP5031.89
10
Object DetectionHabitat Gibson Specialization 1.0 (val)
Box AP5034.11
10
Instance SegmentationHabitat Gibson Specialization 1.0 (val)
Mask AP5031.23
10
Object DetectionHabitat Gibson Generalization 1.0 (val)
AP50 (Box)33.41
10
Showing 10 of 10 rows

Other info

Follow for update