Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

EnD: Entangling and Disentangling deep representations for bias correction

About

Artificial neural networks perform state-of-the-art in an ever-growing number of tasks, and nowadays they are used to solve an incredibly large variety of tasks. There are problems, like the presence of biases in the training data, which question the generalization capability of these models. In this work we propose EnD, a regularization strategy whose aim is to prevent deep models from learning unwanted biases. In particular, we insert an "information bottleneck" at a certain point of the deep neural network, where we disentangle the information about the bias, still letting the useful information for the training task forward-propagating in the rest of the model. One big advantage of EnD is that we do not require additional training complexity (like decoders or extra layers in the model), since it is a regularizer directly applied on the trained model. Our experiments show that EnD effectively improves the generalization on unbiased test sets, and it can be effectively applied on real-case scenarios, like removing hidden biases in the COVID-19 detection from radiographic images.

Enzo Tartaglione, Carlo Alberto Barbano, Marco Grangetto• 2021

Related benchmarks

TaskDatasetResultRank
Image ClassificationColored MNIST unbiased (test)
Accuracy98
28
Image ClassificationCIFAR10-C unbiased (test)
Accuracy43.67
28
Digit ClassificationBiased MNIST unbiased (test)
Accuracy96.02
24
Skin lesion classificationFitzpatrick-17k (test)
Accuracy0.868
17
Image ClassificationBFFHQ bias-conflicting (test)
Accuracy59.8
17
ClassificationBFFHQ (test)
Accuracy @ Thresh 0.50.5596
11
ClassificationWaterbirds 2.0 severity (test)
Accuracy61.26
10
ClassificationWaterbirds 5.0 severity (test)
Accuracy64.11
10
ClassificationWaterbirds severity 0.5 (test)
Accuracy58.47
10
Image ClassificationWaterbirds (test)
Avg Acc (0.5% Bias)58.47
10
Showing 10 of 24 rows

Other info

Follow for update