Experience Replay for Continual Learning

About

Continual learning is the problem of learning new tasks or knowledge while protecting old knowledge and ideally generalizing from old experience to learn new tasks faster. Neural networks trained by stochastic gradient descent often degrade on old tasks when trained successively on new tasks with different data distributions. This phenomenon, referred to as catastrophic forgetting, is considered a major hurdle to learning with non-stationary data or sequences of new tasks, and prevents networks from continually accumulating knowledge and skills. We examine this issue in the context of reinforcement learning, in a setting where an agent is exposed to tasks in a sequence. Unlike most other work, we do not provide an explicit indication to the model of task boundaries, which is the most general circumstance for a learning agent exposed to continuous experience. While various methods to counteract catastrophic forgetting have recently been proposed, we explore a straightforward, general, and seemingly overlooked solution - that of using experience replay buffers for all past events - with a mixture of on- and off-policy learning, leveraging behavioral cloning. We show that this strategy can still learn new tasks quickly yet can substantially reduce catastrophic forgetting in both Atari and DMLab domains, even matching the performance of methods that require task identities. When buffer storage is constrained, we confirm that a simple mechanism for randomly discarding data allows a limited size buffer to perform almost as well as an unbounded one.

David Rolnick, Arun Ahuja, Jonathan Schwarz, Timothy P. Lillicrap, Greg Wayne• 2018

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR-100 (test)	--	3518
Image Classification	CIFAR-10 (test)	--	3381
Mathematical Reasoning	GSM8K	Accuracy70.6	1398
Mathematical Reasoning	MATH	Accuracy29.5	882
Image Classification	Tiny ImageNet (test)	Accuracy68.15	722
Image Classification	CIFAR-10	--	507
Question Answering	SciQ	Accuracy95.1	283
Class-incremental learning	CIFAR-100	Averaged Incremental Accuracy77.02	281
Reading Comprehension	BoolQ	Accuracy88.2	279
Question Answering	ARC	Accuracy57.6	230

Showing 10 of 191 rows

...

Other info

Follow for update

@wizwand_team Discord