Share your thoughts, 1 month free Claude Pro on usSee more

Offline Meta Reinforcement Learning on Hopper-mass (out-of-distribution)

583.4Average Return

SPC

Updated 4mo ago

Evaluation Results

Method	Links
SPC 2026.03		583.4
UNICORN-SS 2026.03		550.7
FOCAL 2026.03		547.3
CSRO 2026.03		543.6
DORA 2026.03		534.2
UNICORN-SUP 2026.03		463.9