Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Offline-to-online Reinforcement Learning on hopper
Loading...
353
Regret
SMAC
268.324
839.887
1,411.45
1,983.013
Feb 19, 2026
Regret
Updated 4d ago
Evaluation Results
Method
Method
Links
Regret
SMAC
Offline Algorithm=SMAC...
2026.02
353
SMAC
Fine-tuning algorithm=...
2026.02
425.5
CalQL/CQL
Offline Algorithm=CalQ...
2026.02
533.7
TD3+BC
Offline Algorithm=TD3+...
2026.02
552.4
IQL
Offline Algorithm=IQL,...
2026.02
958
TD3+BC
Fine-tuning algorithm=...
2026.02
1,295.4
IQL
Fine-tuning algorithm=...
2026.02
1,392.6
CalQL/CQL
Fine-tuning algorithm=...
2026.02
2,469.9
Feedback
Search any
task
Search any
task