Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Offline-to-online Reinforcement Learning on relocate
Loading...
62.8
Regret
SMAC
61.344
71.172
81
90.828
Feb 19, 2026
Regret
Updated 4d ago
Evaluation Results
Method
Method
Links
Regret
SMAC
Offline Algorithm=SMAC...
2026.02
62.8
SMAC
Fine-tuning algorithm=...
2026.02
84.9
IQL
Fine-tuning algorithm=...
2026.02
95.3
IQL
Offline Algorithm=IQL,...
2026.02
95.8
IQL
Offline Algorithm=IQL,...
2026.02
97.4
CalQL/CQL
Offline Algorithm=CalQ...
2026.02
98.1
TD3+BC
Offline Algorithm=TD3+...
2026.02
98.1
TD3+BC
Fine-tuning algorithm=...
2026.02
98.2
SMAC
Offline Algorithm=SMAC...
2026.02
98.3
CalQL/CQL
Offline Algorithm=CalQ...
2026.02
99
CalQL/CQL
Fine-tuning algorithm=...
2026.02
99.1
TD3+BC
Offline Algorithm=TD3+...
2026.02
99.2
Feedback
Search any
task
Search any
task