Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control

About

Deep reinforcement learning for continuous control has recently achieved impressive progress. However, existing methods often suffer from primacy bias, a tendency to overfit early experiences stored in the replay buffer, which limits an RL agent's sample efficiency and generalizability. In contrast, humans are less susceptible to such bias, partly due to infantile amnesia, where the formation of new neurons disrupts early memory traces, leading to the forgetting of initial experiences. Inspired by this dual processes of forgetting and growing in neuroscience, in this paper, we propose Forget and Grow (FoG), a new deep RL algorithm with two mechanisms introduced. First, Experience Replay Decay (ER Decay) "forgetting early experience", which balances memory by gradually reducing the influence of early experiences. Second, Network Expansion, "growing neural capacity", which enhances agents' capability to exploit the patterns of existing data by dynamically adding new parameters during training. Empirical results on four major continuous control benchmarks with more than 40 tasks demonstrate the superior performance of FoG against SoTA existing deep RL algorithms, including BRO, SimBa, and TD-MPC2.

Zilin Kang, Chenyuan Hu, Yu Luo, Zhecheng Yuan, Ruijie Zheng, Huazhe Xu• 2025

Related benchmarks

TaskDatasetResultRank
Continuous ControlMuJoCo Ant v4
Average Return6.76e+3
46
Continuous ControlMuJoCo Walker2d v4--
39
Continuous ControlMuJoCo HalfCheetah v4
Average Return1.17e+4
36
Continuous ControlGym MuJoCo Humanoid v4
Average Return6.74e+3
15
Continuous ControlGym MuJoCo Suite Aggregate
IQM1.242
15
Continuous ControlGym MuJoCo Hopper v4
Average Return1.82e+3
15
Continuous ControlDMC Suite Hard v1 (test)
Dog Run Return613
12
Continuous ControlHumanoidBench (w/ Hand)
Return (Slide)201
12
Continuous ControlDMC Easy
Acrobot Swingup Return414
9
Continuous ControlHumanoidBench without dexterous hands
Pole Score893
8
Showing 10 of 10 rows

Other info

Follow for update