Optimal Representations for Covariate Shift

About

Machine learning systems often experience a distribution shift between training and testing. In this paper, we introduce a simple variational objective whose optima are exactly the set of all representations on which risk minimizers are guaranteed to be robust to any distribution shift that preserves the Bayes predictor, e.g., covariate shifts. Our objective has two components. First, a representation must remain discriminative for the task, i.e., some predictor must be able to simultaneously minimize the source and target risk. Second, the representation's marginal support needs to be the same across source and target. We make this practical by designing self-supervised objectives that only use unlabelled data and augmentations to train robust representations. Our objectives give insights into the robustness of CLIP, and further improve CLIP's representations to achieve SOTA results on DomainBed.

Yangjun Ruan, Yann Dubois, Chris J. Maddison• 2021

Related benchmarks

Task	Dataset	Result
Domain Generalization	PACS (test)	Average Accuracy64.8	225
Domain Generalization	DomainBed v1.0 (test)	Average Accuracy82.3	71
Image Classification	OfficeHome DomainBed suite (test)	Accuracy61	45
Classification	Cele+Asv leave-one-modality-out (val)	Accuracy (Vid Modality)61.3	39
Classification	Fakeavcele leave-one-modality-out (cross-val)	Accuracy (Vid)61.94	39
Classification	LAV-DF leave-one-modality-out cross-validation	Accuracy (Video)59.53	39
Domain Generalization	DomainNet DomainBed (test)	Clipart Accuracy46.1	37
Image Classification	DomainBed	PACS Accuracy81.9	33
Domain Generalization	PACS DomainBed (test)	Accuracy97.2	29
Domain Generalization	OfficeHome DomainBed (test)	Accuracy86.3	29

Showing 10 of 20 rows

Other info

Code

Follow for update

@wizwand_team Discord