Share your thoughts, 1 month free Claude Pro on usSee more

Offline Meta Reinforcement Learning on Ant-dir (out-of-distribution)

410.7Average Return

SPC

Updated 4mo ago

Evaluation Results

Method	Links
SPC 2026.03		410.7
UNICORN-SS 2026.03		405.9
CSRO 2026.03		399.2
FOCAL 2026.03		368.8
UNICORN-SUP 2026.03		211.4
DORA 2026.03		156.8