Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline-to-online Reinforcement Learning on D4RL Cheetah expert discretized

9.7Online Normalized Score

DRIFT

-0.2842.3084.97.492May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
9.70.5
2026.05
8.60.7
2026.05
8.50.7
2026.05
7.70.2
2026.05
6.80.2
2026.05
5.9-
2026.05
4.70.1
2026.05
11.1
2026.05
0.10.1