Learning a Universal Template for Few-shot Dataset Generalization
About
Few-shot dataset generalization is a challenging variant of the well-studied few-shot classification problem where a diverse training set of several datasets is given, for the purpose of training an adaptable model that can then learn classes from new datasets using only a few examples. To this end, we propose to utilize the diverse training set to construct a universal template: a partial model that can define a wide array of dataset-specialized models, by plugging in appropriate components. For each new few-shot classification problem, our approach therefore only requires inferring a small number of parameters to insert into the universal template. We design a separate network that produces an initialization of those parameters for each given task, and we then fine-tune its proposed initialization via a few steps of gradient descent. Our approach is more parameter-efficient, scalable and adaptable compared to previous methods, and achieves the state-of-the-art on the challenging Meta-Dataset benchmark.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Few-shot classification | Meta-Dataset (test) | Omniglot61.6 | 48 | |
| Few-shot classification | Meta-Dataset | Avg Seen Accuracy76.2 | 45 | |
| Few-shot classification | Meta-Dataset 1.0 (test) | ILSVRC Accuracy51.8 | 42 | |
| Few-shot Image Classification | Aircraft (test) | Mean Accuracy82.8 | 28 | |
| Few-shot classification | MNIST Meta-Dataset (test) | Mean Accuracy96.2 | 10 | |
| Few-shot classification | ImageNet Meta-Dataset (test) | Mean Accuracy58.6 | 10 | |
| Few-shot classification | CIFAR-10 Meta-Dataset (test) | Mean Accuracy75.4 | 10 | |
| Few-shot classification | Traffic Sign Meta-Dataset (test) | Mean Accuracy63 | 10 | |
| Few-shot classification | MSCOCO Meta-Dataset (test) | Mean Accuracy0.528 | 10 | |
| Few-shot classification | CIFAR100 (test) | Accuracy62 | 10 |