Share your thoughts, 1 month free Claude Pro on usSee more

Continual Reinforcement Learning on CoinRun Normalized Continual Learning

1.34Max Performance

ARROW

Updated 4mo ago

Evaluation Results

Method	Links
ARROW 2026.03		1.34	3,768,320	100
ARROW 2026.03		1.33	3,768,320	100
DV3 2026.03		1.28	3,604,480	100
DV3 2026.03		1.23	1,802,240	100
ARROW 2026.03		1.19	1,638,400	100
DV3 2026.03		1.19	491,520	100
TES-SAC 2026.03		0.9	-	0
TES-SAC 2026.03		0.86	-	0
TES-SAC 2026.03		0.85	-	0