Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on FailureBench Bounded Push

4,593.96Average Return

FARL

-84.61521,130.01492,344.6453,559.2751Jan 12, 2026
Updated 3mo ago

Evaluation Results

MethodLinks
2026.01
4,593.96
2026.01
420.79
2026.01
156.28
2026.01
95.33