Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

The Augmented Image Prior: Distilling 1000 Classes by Extrapolating from a Single Image

About

What can neural networks learn about the visual world when provided with only a single image as input? While any image obviously cannot contain the multitudes of all existing objects, scenes and lighting conditions - within the space of all 256^(3x224x224) possible 224-sized square images, it might still provide a strong prior for natural images. To analyze this `augmented image prior' hypothesis, we develop a simple framework for training neural networks from scratch using a single image and augmentations using knowledge distillation from a supervised pretrained teacher. With this, we find the answer to the above question to be: `surprisingly, a lot'. In quantitative terms, we find accuracies of 94%/74% on CIFAR-10/100, 69% on ImageNet, and by extending this method to video and audio, 51% on Kinetics-400 and 84% on SpeechCommands. In extensive analyses spanning 13 datasets, we disentangle the effect of augmentations, choice of data and network architectures and also provide qualitative evaluations that include lucid `panda neurons' in networks that have never even seen one.

Yuki M. Asano, Aaqib Saeed• 2021

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-1K 1.0 (val)
Top-1 Accuracy66.2
1952
Image ClassificationCIFAR-100 (val)
Accuracy74.56
776
Knowledge DistillationCIFAR-100 (test)--
29
Knowledge DistillationCIFAR-10 (test)
Accuracy93.69
11
Image ClassificationImageNet-100 12 (test)
Accuracy84.4
3
Image ClassificationFlowers-102 42 (test)
Accuracy81.5
3
Showing 6 of 6 rows

Other info

Follow for update