Share your thoughts, 1 month free Claude Pro on usSee more

Offline Meta Reinforcement Learning on Cheetah-LS (out-of-distribution)

865.5Average Return

SPC

Updated 4mo ago

Evaluation Results

Method	Links
SPC 2026.03		865.5
FOCAL 2026.03		826.6
CSRO 2026.03		813.9
UNICORN-SS 2026.03		806.1
UNICORN-SUP 2026.03		795.8
DORA 2026.03		785.8