Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on MuJoCo Half-Cheetah

13,907Average Return

SiMPO-Lin. Neg.

-545.883,206.316,958.510,710.69May 24, 2023Nov 10, 2023Apr 28, 2024Oct 15, 2024Apr 3, 2025Sep 20, 2025Mar 10, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
13,907
2023.05
13,300
2023.05
13,025
2026.03
9,820
2024.06
9,536.92
2024.06
9,474
2024.06
8,583.55
2024.11
8,543
2024.06
8,467.64
2026.03
8,081
2024.11
7,067
2024.11
6,995
2024.06
6,170.33
2024.06
6,130.71
2024.06
6,092.61
2024.06
4,930.18
2024.11
4,133
2024.06
4,000.98
2024.06
2,350.58
2024.06
206.71
2024.06
36.19
2026.03
13
2026.03
13
2026.03
13
2026.03
13
2026.03
12
2026.03
10
2026.03
10