Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Training Debiased Subnetworks with Contrastive Weight Pruning

About

Neural networks are often biased to spuriously correlated features that provide misleading statistical evidence that does not generalize. This raises an interesting question: ``Does an optimal unbiased functional subnetwork exist in a severely biased network? If so, how to extract such subnetwork?" While empirical evidence has been accumulated about the existence of such unbiased subnetworks, these observations are mainly based on the guidance of ground-truth unbiased samples. Thus, it is unexplored how to discover the optimal subnetworks with biased training datasets in practice. To address this, here we first present our theoretical insight that alerts potential limitations of existing algorithms in exploring unbiased subnetworks in the presence of strong spurious correlations. We then further elucidate the importance of bias-conflicting samples on structure learning. Motivated by these observations, we propose a Debiased Contrastive Weight Pruning (DCWP) algorithm, which probes unbiased subnetworks without expensive group annotations. Experimental results demonstrate that our approach significantly outperforms state-of-the-art debiasing methods despite its considerable reduction in the number of parameters.

Geon Yeong Park, Sangmin Lee, Sang Wan Lee, Jong Chul Ye• 2022

Related benchmarks

TaskDatasetResultRank
Blond Hair classificationCelebA (test)
Average Group Accuracy95.89
30
Image ClassificationColored MNIST unbiased (test)
Accuracy98.02
28
Image ClassificationCIFAR10-C unbiased (test)
Accuracy56.17
28
Image ClassificationBFFHQ bias-conflicting (test)
Accuracy60.35
17
Image ClassificationCMNIST 0.5% bias ratio unbiased (test)
Accuracy85.16
17
Image ClassificationBar (test)
Accuracy (1.0% Bias)69.63
17
Image ClassificationBFFHQ 0.5% bias ratio unbiased (test)
Accuracy (Minority)57.33
11
Image ClassificationCIFAR10C 0.5% bias ratio unbiased (test)
Accuracy31.27
11
Image ClassificationCIFAR10C 5% bias ratio unbiased (test)
Accuracy52.86
11
ClassificationBFFHQ (test)
Accuracy @ Thresh 0.50.6408
11
Showing 10 of 15 rows

Other info

Follow for update