Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Unsupervised Reinforcement Learning on ExORL Cheetah (zero-shot)

378Average Return

One-Step FB

105.52176.26247317.74Feb 11, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
378
2026.02
271
2026.02
187
2026.02
127
2026.02
125
2026.02
116