Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Meta-Curvature

About

We propose meta-curvature (MC), a framework to learn curvature information for better generalization and fast model adaptation. MC expands on the model-agnostic meta-learner (MAML) by learning to transform the gradients in the inner optimization such that the transformed gradients achieve better generalization performance to a new task. For training large scale neural networks, we decompose the curvature matrix into smaller matrices in a novel scheme where we capture the dependencies of the model's parameters with a series of tensor products. We demonstrate the effects of our proposed method on several few-shot learning tasks and datasets. Without any task specific techniques and architectures, the proposed method achieves substantial improvement upon previous MAML variants and outperforms the recent state-of-the-art methods. Furthermore, we observe faster convergence rates of the meta-training process. Finally, we present an analysis that explains better generalization performance with the meta-trained curvature.

Eunbyung Park, Junier B. Oliva• 2019

Related benchmarks

TaskDatasetResultRank
Few-shot classificationtieredImageNet (test)
Accuracy82.61
282
Few-shot Image ClassificationMini-Imagenet (test)
Accuracy80.21
235
Few-shot Image ClassificationminiImageNet (test)--
111
5-way Few-shot ClassificationminiImageNet standard (test)
Accuracy68.47
91
Few-shot Image ClassificationFC100 (test)
Accuracy49.12
69
5-way 5-shot ClassificationOmniglot (test)
Accuracy99.89
49
Few-shot classificationOmniglot 20-way 1-shot (test)
Accuracy99.12
43
Few-shot classificationOmniglot 20-way 5-shot (test)
Accuracy99.65
43
5-way 1-shot ClassificationOmniglot (test)--
34
Few-shot Image ClassificationCIFAR FS (test)
Worst Accuracy16.95
18
Showing 10 of 12 rows

Other info

Code

Follow for update