Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Overcoming catastrophic forgetting in neural networks

About

The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Neural networks are not, in general, capable of this and it has been widely thought that catastrophic forgetting is an inevitable feature of connectionist models. We show that it is possible to overcome this limitation and train networks that can maintain expertise on tasks which they have not experienced for a long time. Our approach remembers old tasks by selectively slowing down learning on the weights important for those tasks. We demonstrate our approach is scalable and effective by solving a set of classification tasks based on the MNIST hand written digit dataset and by learning several Atari 2600 games sequentially.

James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, Joel Veness, Guillaume Desjardins, Andrei A. Rusu, Kieran Milan, John Quan, Tiago Ramalho, Agnieszka Grabska-Barwinska, Demis Hassabis, Claudia Clopath, Dharshan Kumaran, Raia Hadsell• 2016

Related benchmarks

TaskDatasetResultRank
Object Hallucination EvaluationPOPE--
2019
Mathematical ReasoningMATH
Accuracy3.68
882
Multimodal UnderstandingMMBench
Accuracy50.26
847
Language UnderstandingMMLU
Accuracy44.8
844
Time Series ForecastingETTh2
MSE102.2
796
Science Question AnsweringScienceQA--
791
ReasoningBBH
Accuracy25.02
726
Physical Commonsense ReasoningPIQA
Accuracy51.85
696
Question AnsweringARC Challenge
Accuracy (ARC)59.3
598
Time Series ForecastingWeather
MSE147.7
497
Showing 10 of 898 rows
...

Other info

Follow for update