DORO: Distributional and Outlier Robust Optimization

About

Many machine learning tasks involve subpopulation shift where the testing data distribution is a subpopulation of the training distribution. For such settings, a line of recent work has proposed the use of a variant of empirical risk minimization(ERM) known as distributionally robust optimization (DRO). In this work, we apply DRO to real, large-scale tasks with subpopulation shift, and observe that DRO performs relatively poorly, and moreover has severe instability. We identify one direct cause of this phenomenon: sensitivity of DRO to outliers in the datasets. To resolve this issue, we propose the framework of DORO, for Distributional and Outlier Robust Optimization. At the core of this approach is a refined risk function which prevents DRO from overfitting to potential outliers. We instantiate DORO for the Cressie-Read family of R\'enyi divergence, and delve into two specific instances of this family: CVaR and $\chi^2$-DRO. We theoretically prove the effectiveness of the proposed method, and empirically show that DORO improves the performance and stability of DRO with experiments on large modern datasets, thereby positively addressing the open question raised by Hashimoto et al., 2018.

Runtian Zhai, Chen Dan, J. Zico Kolter, Pradeep Ravikumar• 2021

Related benchmarks

Task	Dataset	Result
Image Classification	Waterbirds	WG Accuracy77	283
Classification	CelebA	Avg Accuracy89.6	197
Classification	CelebA (test)	Average Accuracy89.6	92
Classification	Camelyon17	Accuracy68	58
Classification	CivilComments (test)	Average Accuracy90.1	51
Comment Classification	Civil Comments	Accuracy90.1	30
Object Classification	Waterbirds (test)	Worst-Group Accuracy77	22
Classification	CIFAR100	Avg Expected Loss27.4	14
Classification	CIFAR-10 Imbalanced	Average Expected Loss13.3	14
Classification	CIFAR10	Avg Loss11.3	14

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord