Learning to Learn without Forgetting by Maximizing Transfer and Minimizing Interference

About

Lack of performance when it comes to continual learning over non-stationary distributions of data remains a major challenge in scaling neural network learning to more human realistic settings. In this work we propose a new conceptualization of the continual learning problem in terms of a temporally symmetric trade-off between transfer and interference that can be optimized by enforcing gradient alignment across examples. We then propose a new algorithm, Meta-Experience Replay (MER), that directly exploits this view by combining experience replay with optimization based meta-learning. This method learns parameters that make interference based on future gradients less likely and transfer based on future gradients more likely. We conduct experiments across continual lifelong supervised learning benchmarks and non-stationary reinforcement learning environments demonstrating that our approach consistently outperforms recently proposed baselines for continual learning. Our experiments show that the gap between the performance of MER and baseline algorithms grows both as the environment gets more non-stationary and as the fraction of the total experiences stored gets smaller.

Matthew Riemer, Ignacio Cases, Robert Ajemian, Miao Liu, Irina Rish, Yuhai Tu, Gerald Tesauro• 2018

Related benchmarks

Task	Dataset	Result
Continual Learning	Sequential MNIST	Avg Acc99.93	149
Medical Image Segmentation	GLAS	Dice87.61	106
Medical Image Segmentation	LA	Dice87.37	97
Continual Learning	CIFAR100 (test)	Mean Accuracy40.47	69
Continual Learning	Camelyon-TCGA	AACC49.4	64
Class-incremental learning	CIFAR-100 20 tasks	Accuracy5.4	58
Classification	German	--	58
Class-incremental learning	CIFAR-10 Seq	Final Average Accuracy (FAA)57.74	53
Image Classification	CIFAR-10 Seq	Final Average Accuracy93.61	52
Image Classification	Seq-CIFAR-100	Accuracy73.98	52

Showing 10 of 148 rows

...

Other info

Follow for update

@wizwand_team Discord