Learning Probabilistic Models from Generator Latent Spaces with Hat EBM

About

This work proposes a method for using any generator network as the foundation of an Energy-Based Model (EBM). Our formulation posits that observed images are the sum of unobserved latent variables passed through the generator network and a residual random variable that spans the gap between the generator output and the image manifold. One can then define an EBM that includes the generator as part of its forward pass, which we call the Hat EBM. The model can be trained without inferring the latent variables of the observed data or calculating the generator Jacobian determinant. This enables explicit probabilistic modeling of the output distribution of any type of generator network. Experiments show strong performance of the proposed method on (1) unconditional ImageNet synthesis at 128x128 resolution, (2) refining the output of existing generators, and (3) learning EBMs that incorporate non-probabilistic generators. Code and pretrained models to reproduce our results are available at https://github.com/point0bar1/hat-ebm.

Mitch Hill, Erik Nijkamp, Jonathan Mitchell, Bo Pang, Song-Chun Zhu• 2022

Related benchmarks

Task	Dataset	Result
Unconditional Image Generation	CIFAR-10	FID19.3	280
Out-of-Distribution Detection	SVHN (test)	AUROC0.92	72
Out-of-Distribution Detection	CelebA (test)	AUROC94	36
Out-of-Distribution Detection	CIFAR-100 (test)	Average AUROC87	27
Unconditional image synthesis	CIFAR-10 32x32 (test)	FID19.3	12
Image Generation	CelebA 64x64 Unconditional (test)	FID11.57	11
Unconditional image synthesis	ImageNet 128x128 (test)	FID29.37	6

Showing 7 of 7 rows

Other info

Code

Follow for update

@wizwand_team Discord