Towards Conceptual Compression

About

We introduce a simple recurrent variational auto-encoder architecture that significantly improves image modeling. The system represents the state-of-the-art in latent variable models for both the ImageNet and Omniglot datasets. We show that it naturally separates global conceptual information from lower level details, thus addressing one of the fundamentally desired properties of unsupervised learning. Furthermore, the possibility of restricting ourselves to storing only global information about an image allows us to achieve high quality 'conceptual compression'.

Karol Gregor, Frederic Besse, Danilo Jimenez Rezende, Ivo Danihelka, Daan Wierstra• 2016

Related benchmarks

Task	Dataset	Result
Density Estimation	ImageNet 32x32 (test)	Bits per Sub-pixel4.1	69
Generative Modeling	CIFAR-10	BPD3.58	68
Generative Modeling	CIFAR-10 (test)	NLL (bits/dim)3.58	62
Generative Modeling	ImageNet 32x32 downsampled	Bits Per Dimension4.4	24
Density Estimation	OMNIGLOT dynamically binarized (test)	NLL91	16
Likelihood-based generative modeling	Omniglot	NLL (nats)91	10
Density Estimation	ImageNet 32x32 (train)	Bits/dim4.35	3
Density Estimation	ImageNet 64x64 (train)	Bits/dim4.04	3

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord