A Universal Representation Transformer Layer for Few-Shot Image Classification
About
Few-shot classification aims to recognize unseen classes when presented with only a small number of samples. We consider the problem of multi-domain few-shot image classification, where unseen classes and examples come from diverse data sources. This problem has seen growing interest and has inspired the development of benchmarks such as Meta-Dataset. A key challenge in this multi-domain setting is to effectively integrate the feature representations from the diverse set of training domains. Here, we propose a Universal Representation Transformer (URT) layer, that meta-learns to leverage universal features for few-shot classification by dynamically re-weighting and composing the most appropriate domain-specific representations. In experiments, we show that URT sets a new state-of-the-art result on Meta-Dataset. Specifically, it achieves top-performance on the highest number of data sources compared to competing methods. We analyze variants of URT and present a visualization of the attention score heatmaps that sheds light on how the model performs cross-domain generalization. Our code is available at https://github.com/liulu112601/URT.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Few-shot classification | Meta-Dataset (test) | Omniglot94.4 | 48 | |
| Few-shot classification | Meta-Dataset | Avg Seen Accuracy77.4 | 45 | |
| Few-shot classification | Meta-Dataset 1.0 (test) | ILSVRC Accuracy56.8 | 42 | |
| Few-shot Image Classification | Meta-Dataset (test) | Omniglot Accuracy96 | 40 | |
| Few-shot Image Classification | Aircraft (test) | Mean Accuracy85.8 | 28 | |
| Few-shot classification | Quick Draw Meta-Dataset (test) | Mean Accuracy82.4 | 10 | |
| Few-shot classification | Omniglot Meta-Dataset (test) | Mean Accuracy94.2 | 10 | |
| Few-shot classification | Textures Meta-Dataset (test) | Mean Accuracy71.6 | 10 | |
| Few-shot classification | ImageNet Meta-Dataset (test) | Mean Accuracy56.8 | 10 | |
| Few-shot classification | Birds Meta-Dataset (test) | Mean Accuracy76.2 | 10 |