Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Pixel Recurrent Neural Networks

About

Modeling the distribution of natural images is a landmark problem in unsupervised learning. This task requires an image model that is at once expressive, tractable and scalable. We present a deep neural network that sequentially predicts the pixels in an image along the two spatial dimensions. Our method models the discrete probability of the raw pixel values and encodes the complete set of dependencies in the image. Architectural novelties include fast two-dimensional recurrent layers and an effective use of residual connections in deep recurrent networks. We achieve log-likelihood scores on natural images that are considerably better than the previous state of the art. Our main results also provide benchmarks on the diverse ImageNet dataset. Samples generated from the model appear crisp, varied and globally coherent.

Aaron van den Oord, Nal Kalchbrenner, Koray Kavukcuoglu• 2016

Related benchmarks

TaskDatasetResultRank
Image GenerationCIFAR-10 (test)
FID65.93
471
Unconditional Image GenerationCIFAR-10 unconditional
FID65.93
159
Image GenerationCIFAR10 32x32 (test)
FID65.9
154
Density EstimationCIFAR-10 (test)
Bits/dim3
134
Unconditional GenerationCIFAR-10 (test)
FID65.9
102
Density EstimationImageNet 32x32 (test)
Bits per Sub-pixel3.63
66
Anomaly DetectionMNIST (test)
AUC61.8
65
Generative ModelingCIFAR-10 (test)
NLL (bits/dim)3
62
Density EstimationImageNet 64x64 (test)
Bits Per Sub-Pixel3.57
62
Generative ModelingCIFAR-10
BPD3
46
Showing 10 of 46 rows

Other info

Follow for update