MaCow: Masked Convolutional Generative Flow
About
Flow-based generative models, conceptually attractive due to tractability of both the exact log-likelihood computation and latent-variable inference, and efficiency of both training and sampling, has led to a number of impressive empirical successes and spawned many advanced variants and theoretical investigations. Despite their computational efficiency, the density estimation performance of flow-based generative models significantly falls behind those of state-of-the-art autoregressive models. In this work, we introduce masked convolutional generative flow (MaCow), a simple yet effective architecture of generative flow using masked convolution. By restricting the local connectivity in a small kernel, MaCow enjoys the properties of fast and stable training, and efficient sampling, while achieving significant improvements over Glow for density estimation on standard image benchmarks, considerably narrowing the gap to autoregressive models.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Density Estimation | CIFAR-10 (test) | Bits/dim3.16 | 134 | |
| Density Estimation | ImageNet 64x64 (test) | Bits Per Sub-Pixel3.69 | 62 | |
| Generative Modeling | CIFAR-10 | BPD3.16 | 46 | |
| Density Estimation | CIFAR-10 | bpd3.16 | 40 | |
| Unconditional Image Generation | CIFAR10 | BPD3.16 | 33 | |
| Unconditional Image Generation | ImageNet 64 | BPD3.69 | 22 | |
| Density Estimation | ImageNet 64 | Bits-per-dimension3.69 | 16 | |
| Generative Modeling | ImageNet 64x64 downsampled | Bits Per Dimension3.69 | 13 | |
| Density Estimation | ImageNet 32 x 32 | NLL (bits/dim)3.69 | 12 | |
| Density Estimation | CelebAHQ 256 x 256 5-bits | NLL (bits/dim)0.67 | 8 |