Optimal Representations for Covariate Shift
About
Machine learning systems often experience a distribution shift between training and testing. In this paper, we introduce a simple variational objective whose optima are exactly the set of all representations on which risk minimizers are guaranteed to be robust to any distribution shift that preserves the Bayes predictor, e.g., covariate shifts. Our objective has two components. First, a representation must remain discriminative for the task, i.e., some predictor must be able to simultaneously minimize the source and target risk. Second, the representation's marginal support needs to be the same across source and target. We make this practical by designing self-supervised objectives that only use unlabelled data and augmentations to train robust representations. Our objectives give insights into the robustness of CLIP, and further improve CLIP's representations to achieve SOTA results on DomainBed.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Domain Generalization | PACS (test) | Average Accuracy64.8 | 225 | |
| Domain Generalization | DomainBed v1.0 (test) | Average Accuracy82.3 | 71 | |
| Image Classification | OfficeHome DomainBed suite (test) | Accuracy61 | 45 | |
| Domain Generalization | DomainNet DomainBed (test) | Clipart Accuracy46.1 | 37 | |
| Image Classification | DomainBed | PACS Accuracy81.9 | 33 | |
| Domain Generalization | PACS DomainBed (test) | Accuracy97.2 | 29 | |
| Domain Generalization | OfficeHome DomainBed (test) | Accuracy86.3 | 29 | |
| Domain Generalization | PACS, VLCS, OfficeHome, and DomainNet (test) | PACS Accuracy90 | 28 | |
| Domain Generalization | VLCS DomainBed (test) | Average OOD Accuracy76.1 | 27 | |
| Domain Generalization | TerraIncognita DomainBed (test) | Accuracy76.5 | 26 |