Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline-to-online Reinforcement Learning on D4RL Walker expert discretized

14.8Online Normalized Score

DRIFT

-0.3843.5587.511.442May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
14.80.1
2026.05
12.44
2026.05
10.74
2026.05
10.10.2
2026.05
99.4
2026.05
7.20.2
2026.05
7-
2026.05
6.50.2
2026.05
0.20.2