Estimating Generalization under Distribution Shifts via Domain-Invariant Representations

About

When machine learning models are deployed on a test distribution different from the training distribution, they can perform poorly, but overestimate their performance. In this work, we aim to better estimate a model's performance under distribution shift, without supervision. To do so, we use a set of domain-invariant predictors as a proxy for the unknown, true target labels. Since the error of the resulting risk estimate depends on the target risk of the proxy model, we study generalization of domain-invariant representations and show that the complexity of the latent representation has a significant influence on the target risk. Empirically, our approach (1) enables self-tuning of domain adaptation models, and (2) accurately estimates the target error of given models under distribution shift. Other applications include model selection, deciding early stopping and error detection.

Ching-Yao Chuang, Antonio Torralba, Stefanie Jegelka• 2020

Related benchmarks

Task	Dataset	Result
OOD accuracy prediction	iWildCam-WILDS	MAE9.4	16
Accuracy Estimation	Amazon Review	Absolute Estimation Error0.021	10
Error detection	Digits	F1 Score84.4	10
Accuracy Estimation	Digits	Absolute Estimation Error0.085	10
Accuracy Estimation	Office-31	Absolute Estimation Error0.033	10
Accuracy Estimation	CIFAR10-C	Absolute Estimation Error5.2	10
Error detection	CIFAR10-C	F1 Score0.85	10
Error detection	Amazon Review	F1 Score0.434	10
Error detection	Office-31	F1 Score62.9	10
Error detection	iWILDCam	F1 Score77.3	10

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord