Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Offline-to-online Reinforcement Learning on walker2d
Loading...
544.4
Regret
SMAC
460.476
1,026.963
1,593.45
2,159.937
Feb 19, 2026
Regret
Updated 4d ago
Evaluation Results
Method
Method
Links
Regret
SMAC
Offline Algorithm=SMAC...
2026.02
544.4
CalQL/CQL
Offline Algorithm=CalQ...
2026.02
1,136.9
SMAC
Fine-tuning algorithm=...
2026.02
1,232.3
TD3+BC
Fine-tuning algorithm=...
2026.02
1,242.6
IQL
Fine-tuning algorithm=...
2026.02
1,548.6
IQL
Offline Algorithm=IQL,...
2026.02
1,918.7
TD3+BC
Offline Algorithm=TD3+...
2026.02
1,988.4
CalQL/CQL
Fine-tuning algorithm=...
2026.02
2,642.5
Feedback
Search any
task
Search any
task