Share your thoughts, 1 month free Claude Pro on usSee more

Offline Meta Reinforcement Learning on Cheetah-speed out-of-distribution

756Average Return

SPC

Updated 4mo ago

Evaluation Results

Method	Links
SPC 2026.03		756
FOCAL 2026.03		607.8
CSRO 2026.03		603.5
UNICORN-SS 2026.03		598.8
DORA 2026.03		573
UNICORN-SUP 2026.03		554.8