Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Meta Reinforcement Learning on MuJoCo Ant-dir In-distribution

863.1Average Return

SPC

411.636528.843646.05763.257Mar 3, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
863.1
2026.03
812.9
2026.03
804
2026.03
798
2026.03
596.5
2026.03
429