Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Offline-to-online Reinforcement Learning on pen

5.3Regret

SMAC

3.30416.77730.2543.723Feb 19, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
5.3
2026.02
7.7
2026.02
8
2026.02
10.3
2026.02
13.3
2026.02
16.1
2026.02
17.8
2026.02
18
2026.02
28.8
2026.02
31.8
2026.02
32.9
2026.02
55.2