Overcoming catastrophic forgetting in neural networks

About

The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Neural networks are not, in general, capable of this and it has been widely thought that catastrophic forgetting is an inevitable feature of connectionist models. We show that it is possible to overcome this limitation and train networks that can maintain expertise on tasks which they have not experienced for a long time. Our approach remembers old tasks by selectively slowing down learning on the weights important for those tasks. We demonstrate our approach is scalable and effective by solving a set of classification tasks based on the MNIST hand written digit dataset and by learning several Atari 2600 games sequentially.

James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, Joel Veness, Guillaume Desjardins, Andrei A. Rusu, Kieran Milan, John Quan, Tiago Ramalho, Agnieszka Grabska-Barwinska, Demis Hassabis, Claudia Clopath, Dharshan Kumaran, Raia Hadsell• 2016

Related benchmarks

Task	Dataset	Result
Object Hallucination Evaluation	POPE	--	2019
Mathematical Reasoning	MATH	Accuracy3.68	882
Multimodal Understanding	MMBench	Accuracy50.26	847
Language Understanding	MMLU	Accuracy44.8	844
Time Series Forecasting	ETTh2	MSE102.2	796
Science Question Answering	ScienceQA	--	791
Reasoning	BBH	Accuracy25.02	726
Physical Commonsense Reasoning	PIQA	Accuracy51.85	696
Question Answering	ARC Challenge	Accuracy (ARC)59.3	598
Time Series Forecasting	Weather	MSE147.7	497

Showing 10 of 898 rows

...

Other info

Follow for update

@wizwand_team Discord