Share your thoughts, 1 month free Claude Pro on usSee more

Unsupervised Reinforcement Learning on ExORL Cheetah (zero-shot)

378Average Return

One-Step FB

Updated 1mo ago

Evaluation Results

Method	Links
One-Step FB 2026.02		378
FB 2026.02		271
ICVF 2026.02		187
BYOL-gamma 2026.02		127
Laplacian 2026.02		125
HILP 2026.02		116