Share your thoughts, 1 month free Claude Pro on usSee more

Offline-to-online Reinforcement Learning on D4RL Walker expert discretized

14.8Online Normalized Score

DRIFT

Updated 2mo ago

Evaluation Results

Method	Links
DRIFT 2026.05		14.8	0.1
Cal-QL 2026.05		12.4	4
CQL 2026.05		10.7	4
IQL 2026.05		10.1	0.2
AWAC 2026.05		9	9.4
PEX 2026.05		7.2	0.2
PPO 2026.05		7	-
DQN 2026.05		6.5	0.2
SPA 2026.05		0.2	0.2