Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Unsupervised Reinforcement Learning on ExORL Cheetah (zero-shot)

378Average Return

One-Step FB

105.52176.26247317.74Feb 11, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
378
2026.02
271
2026.02
187
2026.02
127
2026.02
125
2026.02
116