Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Offline-to-online Reinforcement Learning on D4RL hopper (Regret)
Loading...
293.7
Regret
CalQL/CQL
176.916
965.208
1,753.5
2,541.792
Feb 19, 2026
Regret
Updated 4d ago
Evaluation Results
Method
Method
Links
Regret
CalQL/CQL
Offline Algorithm=CalQ...
2026.02
293.7
SMAC
Offline Algorithm=SMAC...
2026.02
386.3
IQL
Offline Algorithm=IQL,...
2026.02
798
TD3+BC
Offline Algorithm=TD3+...
2026.02
3,213.3
Feedback
Search any
task
Search any
task