Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Enhancing Multiple Reliability Measures via Nuisance-extended Information Bottleneck

About

In practical scenarios where training data is limited, many predictive signals in the data can be rather from some biases in data acquisition (i.e., less generalizable), so that one cannot prevent a model from co-adapting on such (so-called) "shortcut" signals: this makes the model fragile in various distribution shifts. To bypass such failure modes, we consider an adversarial threat model under a mutual information constraint to cover a wider class of perturbations in training. This motivates us to extend the standard information bottleneck to additionally model the nuisance information. We propose an autoencoder-based training to implement the objective, as well as practical encoder designs to facilitate the proposed hybrid discriminative-generative training concerning both convolutional- and Transformer-based architectures. Our experimental results show that the proposed scheme improves robustness of learned representations (remarkably without using any domain-specific knowledge), with respect to multiple challenging reliability measures. For example, our model could advance the state-of-the-art on a recent challenging OBJECTS benchmark in novelty detection by $78.4\% \rightarrow 87.2\%$ in AUROC, while simultaneously enjoying improved corruption, background and (certified) adversarial robustness. Code is available at https://github.com/jh-jeong/nuisance_ib.

Jongheon Jeong, Sihyun Yu, Hankook Lee, Jinwoo Shin• 2023

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-1K--
524
Image ClassificationImageNet-R--
474
Image GenerationCIFAR-10 (test)
FID12.6
471
Image ClassificationImageNet-Sketch--
360
Image ClassificationCINIC-10 (test)--
177
Image ClassificationImageNet-C
mCE57.5
103
Out-of-Distribution DetectionCIFAR-10 vs SVHN (test)
AUROC0.98
101
Out-of-Distribution DetectionCIFAR-10 vs CIFAR-100 (test)
AUROC86
93
Out-of-Distribution DetectionCIFAR-10 in-distribution LSUN out-of-distribution (test)
AUROC99
73
Image ClassificationCIFAR-10-C (test)--
61
Showing 10 of 25 rows

Other info

Code

Follow for update