Learning to Self-Train for Semi-Supervised Few-Shot Classification
About
Few-shot classification (FSC) is challenging due to the scarcity of labeled training data (e.g. only one labeled data point per class). Meta-learning has shown to achieve promising results by learning to initialize a classification model for FSC. In this paper we propose a novel semi-supervised meta-learning method called learning to self-train (LST) that leverages unlabeled data and specifically meta-learns how to cherry-pick and label such unsupervised data to further improve performance. To this end, we train the LST model through a large number of semi-supervised few-shot tasks. On each task, we train a few-shot model to predict pseudo labels for unlabeled data, and then iterate the self-training steps on labeled and pseudo-labeled data with each step followed by fine-tuning. We additionally learn a soft weighting network (SWN) to optimize the self-training weights of pseudo labels so that better ones can contribute more to gradient descent optimization. We evaluate our LST method on two ImageNet benchmarks for semi-supervised few-shot classification and achieve large improvements over the state-of-the-art method. Code is at https://github.com/xinzheli1217/learning-to-self-train.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Few-shot classification | tieredImageNet (test) | Accuracy85.2 | 282 | |
| Few-shot Image Classification | Mini-Imagenet (test) | Accuracy78.7 | 235 | |
| Few-shot classification | Mini-ImageNet | 1-shot Acc70.1 | 175 | |
| 5-way Few-shot Classification | MiniImagenet | Accuracy (5-shot)78.7 | 150 | |
| 5-way Few-shot Classification | Mini-Imagenet (test) | -- | 141 | |
| 5-way Image Classification | tieredImageNet 5-way (test) | 1-shot Acc77.7 | 117 | |
| Few-shot classification | Mini-Imagenet 5-way 5-shot | Accuracy78.7 | 87 | |
| 5-way Few-shot Classification | tieredImageNet | Accuracy (1-shot)77.7 | 49 | |
| 5-way Classification | miniImageNet 5-way (test) | Accuracy (1-shot)70.1 | 47 | |
| Few-shot Image Classification | MiniImageNet 5-way 1-shot | Accuracy70.1 | 28 |