Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Residual Flows for Invertible Generative Modeling

About

Flow-based generative models parameterize probability distributions through an invertible transformation and can be trained by maximum likelihood. Invertible residual networks provide a flexible family of transformations where only Lipschitz conditions rather than strict architectural constraints are needed for enforcing invertibility. However, prior work trained invertible residual networks for density estimation by relying on biased log-density estimates whose bias increased with the network's expressiveness. We give a tractable unbiased estimate of the log density using a "Russian roulette" estimator, and reduce the memory required during training by using an alternative infinite series for the gradient. Furthermore, we improve invertible residual blocks by proposing the use of activation functions that avoid derivative saturation and generalizing the Lipschitz condition to induced mixed norms. The resulting approach, called Residual Flows, achieves state-of-the-art performance on density estimation amongst flow-based models, and outperforms networks that use coupling blocks at joint generative and discriminative modeling.

Ricky T. Q. Chen, Jens Behrmann, David Duvenaud, J\"orn-Henrik Jacobsen• 2019

Related benchmarks

TaskDatasetResultRank
Image GenerationCIFAR-10 (test)
FID46.3
471
Unconditional Image GenerationCIFAR-10 (test)
FID46.37
216
Density EstimationCIFAR-10 (test)
Bits/dim3.28
134
Density EstimationImageNet 32x32 (test)
Bits per Sub-pixel4.01
66
Density EstimationImageNet 64x64 (test)
Bits Per Sub-Pixel3.757
62
Density EstimationMNIST (test)
NLL (bits/dim)0.97
56
Generative ModelingCIFAR-10
BPD3.28
46
Image GenerationMNIST--
44
Density EstimationCIFAR-10
bpd3.28
40
Unconditional Image GenerationCIFAR10
BPD3.28
33
Showing 10 of 27 rows

Other info

Follow for update