Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline-to-online Reinforcement Learning on D4RL Hopper medium discretized

47.9Online Normalized Score

DRIFT

-1.511.32524.1536.975May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
47.90.1
2026.05
44.128.8
2026.05
43.10.4
2026.05
35.828.8
2026.05
27.90.4
2026.05
25.326
2026.05
23.70.4
2026.05
3.1-
2026.05
0.40.4