Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Probabilistic Models from Generator Latent Spaces with Hat EBM

About

This work proposes a method for using any generator network as the foundation of an Energy-Based Model (EBM). Our formulation posits that observed images are the sum of unobserved latent variables passed through the generator network and a residual random variable that spans the gap between the generator output and the image manifold. One can then define an EBM that includes the generator as part of its forward pass, which we call the Hat EBM. The model can be trained without inferring the latent variables of the observed data or calculating the generator Jacobian determinant. This enables explicit probabilistic modeling of the output distribution of any type of generator network. Experiments show strong performance of the proposed method on (1) unconditional ImageNet synthesis at 128x128 resolution, (2) refining the output of existing generators, and (3) learning EBMs that incorporate non-probabilistic generators. Code and pretrained models to reproduce our results are available at https://github.com/point0bar1/hat-ebm.

Mitch Hill, Erik Nijkamp, Jonathan Mitchell, Bo Pang, Song-Chun Zhu• 2022

Related benchmarks

TaskDatasetResultRank
Unconditional Image GenerationCIFAR-10
FID19.3
171
Out-of-Distribution DetectionSVHN (test)
AUROC0.92
48
Out-of-Distribution DetectionCelebA (test)
AUROC94
36
Out-of-Distribution DetectionCIFAR-100 (test)
Average AUROC87
27
Unconditional image synthesisCIFAR-10 32x32 (test)
FID19.3
12
Image GenerationCelebA 64x64 Unconditional (test)
FID11.57
11
Unconditional image synthesisImageNet 128x128 (test)
FID29.37
6
Showing 7 of 7 rows

Other info

Code

Follow for update