Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

About

We propose an algorithm for meta-learning that is model-agnostic, in the sense that it is compatible with any model trained with gradient descent and applicable to a variety of different learning problems, including classification, regression, and reinforcement learning. The goal of meta-learning is to train a model on a variety of learning tasks, such that it can solve new learning tasks using only a small number of training samples. In our approach, the parameters of the model are explicitly trained such that a small number of gradient steps with a small amount of training data from a new task will produce good generalization performance on that task. In effect, our method trains the model to be easy to fine-tune. We demonstrate that this approach leads to state-of-the-art performance on two few-shot image classification benchmarks, produces good results on few-shot regression, and accelerates fine-tuning for policy gradient reinforcement learning with neural network policies.

Chelsea Finn, Pieter Abbeel, Sergey Levine• 2017

Related benchmarks

TaskDatasetResultRank
Node ClassificationCora
Accuracy57.39
583
Node ClassificationCiteseer
Accuracy54.36
503
ClassificationCars
Accuracy48.82
492
Image ClassificationAircraft
Accuracy66.18
340
Image ClassificationCUB
Accuracy64.17
331
Node ClassificationCora-ML
Accuracy71.27
326
Object CountingFSC-147 (test)
MAE24.9
322
Node ClassificationOgbn-arxiv
Accuracy36.19
304
Few-shot classificationtieredImageNet (test)
Accuracy72.41
282
Object CountingFSC-147 (val)
MAE25.54
246
Showing 10 of 427 rows
...

Other info

Follow for update