Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning

About

Recent self-supervised methods for image representation learning are based on maximizing the agreement between embedding vectors from different views of the same image. A trivial solution is obtained when the encoder outputs constant vectors. This collapse problem is often avoided through implicit biases in the learning architecture, that often lack a clear justification or interpretation. In this paper, we introduce VICReg (Variance-Invariance-Covariance Regularization), a method that explicitly avoids the collapse problem with a simple regularization term on the variance of the embeddings along each dimension individually. VICReg combines the variance term with a decorrelation mechanism based on redundancy reduction and covariance regularization, and achieves results on par with the state of the art on several downstream tasks. In addition, we show that incorporating our new variance term into other methods helps stabilize the training and leads to performance improvements.

Adrien Bardes, Jean Ponce, Yann LeCun• 2021

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K (val)
mIoU36.9
2731
Object DetectionCOCO 2017 (val)
AP39.4
2454
Image ClassificationImageNet-1k (val)
Top-1 Accuracy77.3
1453
Image ClassificationImageNet (val)--
1206
Instance SegmentationCOCO 2017 (val)--
1144
Semantic segmentationADE20K
mIoU32.7
936
Image ClassificationImageNet 1k (test)
Top-1 Accuracy71.5
798
Object DetectionCOCO (val)--
613
Image ClassificationFood-101--
494
Instance SegmentationCOCO (val)
APmk36.5
472
Showing 10 of 80 rows
...

Other info

Follow for update