Equivariant Contrastive Learning

About

In state-of-the-art self-supervised learning (SSL) pre-training produces semantically good representations by encouraging them to be invariant under meaningful transformations prescribed from human knowledge. In fact, the property of invariance is a trivial instance of a broader class called equivariance, which can be intuitively understood as the property that representations transform according to the way the inputs transform. Here, we show that rather than using only invariance, pre-training that encourages non-trivial equivariance to some transformations, while maintaining invariance to other transformations, can be used to improve the semantic quality of representations. Specifically, we extend popular SSL methods to a more general framework which we name Equivariant Self-Supervised Learning (E-SSL). In E-SSL, a simple additional pre-training objective encourages equivariance by predicting the transformations applied to the input. We demonstrate E-SSL's effectiveness empirically on several popular computer vision benchmarks, e.g. improving SimCLR to 72.5% linear probe accuracy on ImageNet. Furthermore, we demonstrate usefulness of E-SSL for applications beyond computer vision; in particular, we show its utility on regression problems in photonics science. Our code, datasets and pre-trained models are available at https://github.com/rdangovs/essl to aid further research in E-SSL.

Rumen Dangovski, Li Jing, Charlotte Loh, Seungwook Han, Akash Srivastava, Brian Cheung, Pulkit Agrawal, Marin Solja\v{c}i\'c• 2021

Related benchmarks

Task	Dataset	Result
Image Classification	ImageNet V2	Top-1 Acc58.33	767
Image Classification	ImageNet-R	Top-1 Acc19.86	622
Image Classification	ImageNet-Sketch	Top-1 Accuracy19.23	491
Image Classification	ImageNet-C (test)	Defocus Blur Acc38.44	139
Image Classification	ImageNet-1K 1.0 (val)	Top-1 Acc75	45
Multi-Label Classification	CheXpert	AUROC84.88	22
Image Classification	ImageNet-C	Accuracy (Gaussian Noise)68.7	18
Segmentation	BraTS 2023	Dice84.6	11
Image Classification	ImageNet-P	Accuracy (Gaussian Noise)89.6	7

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord