Meta-SGD: Learning to Learn Quickly for Few-Shot Learning
About
Few-shot learning is challenging for learning algorithms that learn each task in isolation and from scratch. In contrast, meta-learning learns from many related tasks a meta-learner that can learn a new task more accurately and faster with fewer examples, where the choice of meta-learners is crucial. In this paper, we develop Meta-SGD, an SGD-like, easily trainable meta-learner that can initialize and adapt any differentiable learner in just one step, on both supervised learning and reinforcement learning. Compared to the popular meta-learner LSTM, Meta-SGD is conceptually simpler, easier to implement, and can be learned more efficiently. Compared to the latest meta-learner MAML, Meta-SGD has a much higher capacity by learning to learn not just the learner initialization, but also the learner update direction and learning rate, all in a single meta-learning process. Meta-SGD shows highly competitive performance for few-shot learning on regression, classification, and reinforcement learning.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | Aircraft | Accuracy66.84 | 302 | |
| Few-shot classification | tieredImageNet (test) | Accuracy69.04 | 282 | |
| Few-shot Image Classification | Mini-Imagenet (test) | Accuracy68.84 | 235 | |
| 5-way Classification | miniImageNet (test) | Accuracy64.03 | 231 | |
| Few-shot classification | Mini-ImageNet | 1-shot Acc50.47 | 175 | |
| Few-shot classification | miniImageNet standard (test) | 5-way 1-shot Acc54.24 | 138 | |
| 5-way Image Classification | tieredImageNet 5-way (test) | 1-shot Acc62.95 | 117 | |
| Few-shot classification | MiniImagenet | 5-way 5-shot Accuracy64.03 | 98 | |
| 5-way Few-shot Classification | miniImageNet standard (test) | Accuracy64.03 | 91 | |
| Few-shot classification | Mini-Imagenet 5-way 5-shot | Accuracy64.03 | 87 |