Pushing the Limits of Simple Pipelines for Few-Shot Learning: External Data and Fine-Tuning Make a Difference
About
Few-shot learning (FSL) is an important and topical problem in computer vision that has motivated extensive research into numerous methods spanning from sophisticated meta-learning methods to simple transfer learning baselines. We seek to push the limits of a simple-but-effective pipeline for more realistic and practical settings of few-shot image classification. To this end, we explore few-shot learning from the perspective of neural network architecture, as well as a three stage pipeline of network updates under different data supplies, where unsupervised external data is considered for pre-training, base categories are used to simulate few-shot tasks for meta-training, and the scarcely labelled data of an novel task is taken for fine-tuning. We investigate questions such as: (1) How pre-training on external data benefits FSL? (2) How state-of-the-art transformer architectures can be exploited? and (3) How fine-tuning mitigates domain shift? Ultimately, we show that a simple transformer-based pipeline yields surprisingly good performance on standard benchmarks such as Mini-ImageNet, CIFAR-FS, CDFSL and Meta-Dataset. Our code and demo are available at https://hushell.github.io/pmf.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Few-shot classification | Mini-ImageNet | 1-shot Acc95.3 | 175 | |
| Few-shot classification | CUB (test) | Accuracy97 | 145 | |
| Few-shot Image Classification | miniImageNet (test) | -- | 111 | |
| Few-shot classification | MiniImagenet | 5-way 5-shot Accuracy98.4 | 98 | |
| Few-shot classification | Mini-Imagenet 5-way 5-shot | Accuracy98 | 87 | |
| Few-shot Image Classification | tieredImageNet (test) | Accuracy97.3 | 86 | |
| Few-shot classification | CIFAR-FS | Accuracy (5-way 1-shot)84.3 | 58 | |
| Few-shot classification | ChestX (test) | Accuracy32.1 | 46 | |
| Few-shot classification | Meta-Dataset | Avg Seen Accuracy85 | 45 | |
| Few-shot classification | Meta-Dataset 1.0 (test) | ILSVRC Accuracy77.02 | 42 |