Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Probable Domain Generalization via Quantile Risk Minimization

About

Domain generalization (DG) seeks predictors which perform well on unseen test distributions by leveraging data drawn from multiple related training distributions or domains. To achieve this, DG is commonly formulated as an average- or worst-case problem over the set of possible domains. However, predictors that perform well on average lack robustness while predictors that perform well in the worst case tend to be overly-conservative. To address this, we propose a new probabilistic framework for DG where the goal is to learn predictors that perform well with high probability. Our key idea is that distribution shifts seen during training should inform us of probable shifts at test time, which we realize by explicitly relating training and test domains as draws from the same underlying meta-distribution. To achieve probable DG, we propose a new optimization problem called Quantile Risk Minimization (QRM). By minimizing the $\alpha$-quantile of predictor's risk distribution over domains, QRM seeks predictors that perform well with probability $\alpha$. To solve QRM in practice, we propose the Empirical QRM (EQRM) algorithm and provide: (i) a generalization bound for EQRM; and (ii) the conditions under which EQRM recovers the causal predictor as $\alpha \to 1$. In our experiments, we introduce a more holistic quantile-focused evaluation protocol for DG and demonstrate that EQRM outperforms state-of-the-art baselines on datasets from WILDS and DomainBed.

Cian Eastwood, Alexander Robey, Shashank Singh, Julius von K\"ugelgen, Hamed Hassani, George J. Pappas, Bernhard Sch\"olkopf• 2022

Related benchmarks

TaskDatasetResultRank
Image ClassificationPACS
Overall Average Accuracy66.7
230
Domain GeneralizationPACS
Accuracy (Art)86.5
221
Out-of-Distribution Detection and GeneralizationCIFAR-10 ID LSUN-C semantic OOD & CIFAR-10-C covariate OOD
OOD Accuracy75.71
74
Domain GeneralizationOffice-Home
Average Accuracy67.5
63
Image ClassificationCMNIST (test)
Test Accuracy71.4
55
Out-of-Distribution Detection and GeneralizationCIFAR-10 ID SVHN semantic OOD CIFAR-10-C covariate OOD
OOD Accuracy75.71
38
Generalized OOD DetectionCIFAR-10 with Places365 (semantic OOD) and CIFAR-10-C (covariate OOD) (test)
OOD Accuracy75.71
38
Out-of-Distribution Detection and GeneralizationCIFAR-10 ID Textures semantic OOD CIFAR-10-C covariate OOD
OOD Accuracy75.71
38
Image ClassificationDomainBed v1.0 (test)
Average Accuracy64.1
36
Domain GeneralizationVLCS
Accuracy (L)63.7
27
Showing 10 of 22 rows

Other info

Code

Follow for update