Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations

About

Neural network classifiers can largely rely on simple spurious features, such as backgrounds, to make predictions. However, even in these cases, we show that they still often learn core features associated with the desired attributes of the data, contrary to recent findings. Inspired by this insight, we demonstrate that simple last layer retraining can match or outperform state-of-the-art approaches on spurious correlation benchmarks, but with profoundly lower complexity and computational expenses. Moreover, we show that last layer retraining on large ImageNet-trained models can also significantly reduce reliance on background and texture information, improving robustness to covariate shift, after only minutes of training on a single GPU.

Polina Kirichenko, Pavel Izmailov, Andrew Gordon Wilson• 2022

Related benchmarks

Task	Dataset	Result
Sentiment Classification	SST2 (test)	Accuracy91.5	233
Image Classification	Waterbirds	Average Accuracy96.1	209
Image Classification	Waterbirds (test)	Worst-Group Accuracy92.9	127
Classification	CelebA (test)	--	92
Image Classification	CelebA (test)	--	82
Natural Language Inference	MultiNLI (test)	--	81
Fine grained classification	Stanford Cars	Accuracy82.74	74
Image Classification	CelebA	WG Score89.6	62
Classification	CivilComments (test)	--	51
Group Robustness	CivilComments-WILDS (test)	WG Accuracy48.2	40

Showing 10 of 78 rows

...

Other info

Follow for update

@wizwand_team Discord