Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline-to-online Reinforcement Learning on D4RL Hopper expert discretized

47.1Online Normalised Score

DRIFT

-1.57211.06423.736.336May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
47.10.4
2026.05
39.50.1
2026.05
29.38.8
2026.05
25.78.8
2026.05
21.70.3
2026.05
17.20.1
2026.05
11.211.5
2026.05
3.7-
2026.05
0.30.3