Estimating Generalization under Distribution Shifts via Domain-Invariant Representations
About
When machine learning models are deployed on a test distribution different from the training distribution, they can perform poorly, but overestimate their performance. In this work, we aim to better estimate a model's performance under distribution shift, without supervision. To do so, we use a set of domain-invariant predictors as a proxy for the unknown, true target labels. Since the error of the resulting risk estimate depends on the target risk of the proxy model, we study generalization of domain-invariant representations and show that the complexity of the latent representation has a significant influence on the target risk. Empirically, our approach (1) enables self-tuning of domain adaptation models, and (2) accurately estimates the target error of given models under distribution shift. Other applications include model selection, deciding early stopping and error detection.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| OOD accuracy prediction | iWildCam-WILDS | MAE9.4 | 16 | |
| Accuracy Estimation | Amazon Review | Absolute Estimation Error0.021 | 10 | |
| Error detection | Digits | F1 Score84.4 | 10 | |
| Accuracy Estimation | Digits | Absolute Estimation Error0.085 | 10 | |
| Accuracy Estimation | Office-31 | Absolute Estimation Error0.033 | 10 | |
| Accuracy Estimation | CIFAR10-C | Absolute Estimation Error5.2 | 10 | |
| Error detection | CIFAR10-C | F1 Score0.85 | 10 | |
| Error detection | Amazon Review | F1 Score0.434 | 10 | |
| Error detection | Office-31 | F1 Score62.9 | 10 | |
| Error detection | iWILDCam | F1 Score77.3 | 10 |