Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Meta Reinforcement Learning on Walker-friction (out-of-distribution)

484.6Average Return

UNICORN-SS

433.224446.562459.9473.238Mar 3, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
484.6
2026.03
475.1
2026.03
474.1
2026.03
473.7
2026.03
462.4
2026.03
435.2