Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Offline-to-online Reinforcement Learning on D4RL walker2d
Loading...
650.5
Regret
SMAC
486.936
1,590.993
2,695.05
3,799.107
Feb 19, 2026
Regret
Updated 4d ago
Evaluation Results
Method
Method
Links
Regret
SMAC
Offline Algorithm=SMAC...
2026.02
650.5
CalQL/CQL
Offline Algorithm=CalQ...
2026.02
1,553.7
IQL
Offline Algorithm=IQL,...
2026.02
1,801.2
TD3+BC
Offline Algorithm=TD3+...
2026.02
4,739.6
Feedback
Search any
task
Search any
task