Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Your Classifier is Secretly an Energy Based Model and You Should Treat it Like One

About

We propose to reinterpret a standard discriminative classifier of p(y|x) as an energy based model for the joint distribution p(x,y). In this setting, the standard class probabilities can be easily computed as well as unnormalized values of p(x) and p(x|y). Within this framework, standard discriminative architectures may beused and the model can also be trained on unlabeled data. We demonstrate that energy based training of the joint distribution improves calibration, robustness, andout-of-distribution detection while also enabling our models to generate samplesrivaling the quality of recent GAN approaches. We improve upon recently proposed techniques for scaling up the training of energy based models and presentan approach which adds little overhead compared to standard classification training. Our approach is the first to achieve performance rivaling the state-of-the-artin both generative and discriminative learning within one hybrid model.

Will Grathwohl, Kuan-Chieh Wang, J\"orn-Henrik Jacobsen, David Duvenaud, Mohammad Norouzi, Kevin Swersky• 2019

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR10 (test)
Accuracy92.9
585
Image GenerationCIFAR-10 (test)
FID38.4
471
Unconditional Image GenerationCIFAR-10
FID38.4
171
Out-of-Distribution DetectionCIFAR-10 vs SVHN (test)
AUROC0.89
101
Out-of-Distribution DetectionCIFAR-10 vs CIFAR-100 (test)
AUROC87
93
Conditional Image GenerationCIFAR10 (test)
Fréchet Inception Distance38.4
66
Out-of-Distribution DetectionCIFAR100 (test)
AUROC87
57
Out-of-Distribution DetectionCIFAR-10 (ID) vs Celeb-A (OOD)
AUROC79
55
Out-of-Distribution DetectionSVHN (test)
AUROC0.89
48
Out-of-Distribution DetectionCelebA (test)
AUROC79
36
Showing 10 of 19 rows

Other info

Follow for update