Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Walker2d v4

39,641,353Avg Return

MDC-SAN

-1,582,613.9929,119,762.05419,822,138.130,524,514.146Jan 29, 2026Feb 4, 2026Feb 11, 2026Feb 18, 2026Feb 24, 2026Mar 3, 2026Mar 10, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
39,641,353
2026.02
33,071,514
18,621,450
2026.02
4,436,196
2026.02
4,340,383
2026.02
4,314,423
2026.02
4,235,354
2026.02
4,200,717
2026.01
4,497
2026.01
4,445
2026.03
4,367.1
2026.01
4,271
2026.03
4,128.8
2026.01
4,050
2026.01
4,003
2026.03
3,277
2026.03
2,923.2