Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Principled Fast and Meta Knowledge Learners for Continual Reinforcement Learning

About

Inspired by the human learning and memory system, particularly the interplay between the hippocampus and cerebral cortex, this study proposes a dual-learner framework comprising a fast learner and a meta learner to address continual Reinforcement Learning~(RL) problems. These two learners are coupled to perform distinct yet complementary roles: the fast learner focuses on knowledge transfer, while the meta learner ensures knowledge integration. In contrast to traditional multi-task RL approaches that share knowledge through average return maximization, our meta learner incrementally integrates new experiences by explicitly minimizing catastrophic forgetting, thereby supporting efficient cumulative knowledge transfer for the fast learner. To facilitate rapid adaptation in new environments, we introduce an adaptive meta warm-up mechanism that selectively harnesses past knowledge. We conduct experiments in various pixel-based and continuous control benchmarks, revealing the superior performance of continual learning for our proposed dual-learner approach relative to baseline methods. The code is released in https://github.com/datake/FAME.

Ke Sun, Hongming Zhang, Jun Jin, Chao Gao, Xi Chen, Wulong Liu, Linglong Kong• 2026

Related benchmarks

TaskDatasetResultRank
Continual LearningCW10 (sequence)
Performance76.7
27
Continual Reinforcement LearningMeta-World average over three sequences
Average Performance76.7
6
Continual Reinforcement LearningMinAtar
Breakout Score14.54
6
Robotic ManipulationMeta-World (averaged over 3 sequences)
Average Performance0.767
6
Continual Reinforcement LearningALE/Freeway v5
Average Performance90
5
Continual Reinforcement LearningALE SpaceInvaders v5
Average Performance96
5
Showing 6 of 6 rows

Other info

Follow for update