Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples

About

This paper proposes a novel method of learning by predicting view assignments with support samples (PAWS). The method trains a model to minimize a consistency loss, which ensures that different views of the same unlabeled instance are assigned similar pseudo-labels. The pseudo-labels are generated non-parametrically, by comparing the representations of the image views to those of a set of randomly sampled labeled images. The distance between the view representations and labeled representations is used to provide a weighting over class labels, which we interpret as a soft pseudo-label. By non-parametrically incorporating labeled samples in this way, PAWS extends the distance-metric loss used in self-supervised methods such as BYOL and SwAV to the semi-supervised setting. Despite the simplicity of the approach, PAWS outperforms other semi-supervised methods across architectures, setting a new state-of-the-art for a ResNet-50 on ImageNet trained with either 10% or 1% of the labels, reaching 75.5% and 66.5% top-1 respectively. PAWS requires 4x to 12x less training than the previous best methods.

Mahmoud Assran, Mathilde Caron, Ishan Misra, Piotr Bojanowski, Armand Joulin, Nicolas Ballas, Michael Rabbat• 2021

Related benchmarks

Task	Dataset	Result
Image Classification	ImageNet 1% labeled	--	118
Image Classification	ImageNet (10% labels)	Top-1 Acc75.5	98
Image Classification	ImageNet 1k (10% labels)	Top-1 Acc75.5	92
Image Classification	CIFAR-10 4000 labels	--	68
Image Classification	CIFAR-10 4,000 labels (test)	--	62
Image Classification	ImageNet 1k (1%)	Top-1 Acc66.5	49
Classification	AID (test)	Top-1 Accuracy68	41
Image Classification	ImageNet 1.0 (10% labeled)	Accuracy79	33
Semi-supervised Image Classification	ImageNet (10% labeled)	Top-1 Accuracy79	32
Image-to-Text Retrieval	RSICD (val)	R@527.4	31

Showing 10 of 52 rows

Other info

Code

Follow for update

@wizwand_team Discord