Towards Conceptual Compression
About
We introduce a simple recurrent variational auto-encoder architecture that significantly improves image modeling. The system represents the state-of-the-art in latent variable models for both the ImageNet and Omniglot datasets. We show that it naturally separates global conceptual information from lower level details, thus addressing one of the fundamentally desired properties of unsupervised learning. Furthermore, the possibility of restricting ourselves to storing only global information about an image allows us to achieve high quality 'conceptual compression'.
Karol Gregor, Frederic Besse, Danilo Jimenez Rezende, Ivo Danihelka, Daan Wierstra• 2016
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Density Estimation | ImageNet 32x32 (test) | Bits per Sub-pixel4.1 | 66 | |
| Generative Modeling | CIFAR-10 (test) | NLL (bits/dim)3.58 | 62 | |
| Generative Modeling | CIFAR-10 | BPD3.58 | 46 | |
| Generative Modeling | ImageNet 32x32 downsampled | Bits Per Dimension4.4 | 24 | |
| Density Estimation | OMNIGLOT dynamically binarized (test) | NLL91 | 16 | |
| Density Estimation | ImageNet 32x32 (train) | Bits/dim4.35 | 3 | |
| Density Estimation | ImageNet 64x64 (train) | Bits/dim4.04 | 3 |
Showing 7 of 7 rows