Share your thoughts, 1 month free Claude Pro on usSee more

Offline Meta Reinforcement Learning on Walker-friction (out-of-distribution)

484.6Average Return

UNICORN-SS

Updated 4mo ago

Evaluation Results

Method	Links
UNICORN-SS 2026.03		484.6
CSRO 2026.03		475.1
SPC 2026.03		474.1
FOCAL 2026.03		473.7
DORA 2026.03		462.4
UNICORN-SUP 2026.03		435.2