Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Representation Learning via Invariant Causal Mechanisms

About

Self-supervised learning has emerged as a strategy to reduce the reliance on costly supervised signal by pretraining representations only using unlabeled data. These methods combine heuristic proxy classification tasks with data augmentations and have achieved significant success, but our theoretical understanding of this success remains limited. In this paper we analyze self-supervised representation learning using a causal framework. We show how data augmentations can be more effectively utilized through explicit invariance constraints on the proxy classifiers employed during pretraining. Based on this, we propose a novel self-supervised objective, Representation Learning via Invariant Causal Mechanisms (ReLIC), that enforces invariant prediction of proxy targets across augmentations through an invariance regularizer which yields improved generalization guarantees. Further, using causality we generalize contrastive learning, a particular kind of self-supervised method, and provide an alternative theoretical explanation for the success of these methods. Empirically, ReLIC significantly outperforms competing methods in terms of robustness and out-of-distribution generalization on ImageNet, while also significantly outperforming these methods on Atari achieving above human-level performance on $51$ out of $57$ games.

Jovana Mitrovic, Brian McWilliams, Jacob Walker, Lars Buesing, Charles Blundell• 2020

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet 1k (test)
Top-1 Accuracy74.8
798
Image ClassificationImageNet (test)
Top-1 Acc74.8
235
Image ClassificationImageNet-Sketch (test)
Top-1 Acc0.091
132
Image ClassificationImageNet-C (test)
mCE (Mean Corruption Error)70.8
110
Image ClassificationImageNet-R (test)--
105
Image ClassificationImageNet matched frequency V2 (test)
Top-1 Accuracy63.1
62
Image ClassificationImageNetV2 Threshold 0.7 (test)
Top-1 Acc72.3
8
Image ClassificationImageNetV2 Top Images Ti (test)
Top-1 Acc77.7
8
Image ClassificationImageNet-C Average of 15 corruptions (test)
Top-1 Acc44.5
5
Image ClassificationObjectNet latest (test)
Top-1 Acc23.8
5
Showing 10 of 10 rows

Other info

Follow for update