Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on HalfCheetah v3

17,177Mean Reward

DACER

-979.6323,734.1098,447.8513,161.591Apr 30, 2022Dec 25, 2022Aug 21, 2023Apr 16, 2024Dec 11, 2024Aug 7, 2025Apr 4, 2026
Updated 11d ago

Evaluation Results

MethodLinks
2024.05
17,177-
2024.05
17,025-
2024.05
16,573-
2024.05
13,970-
2024.05
8,632-
2026.04
8,608.4-
2024.05
5,789-
2026.04
5,226.4-
2026.04
5,216.3-
2026.04
4,819.5-
2024.05
4,785-
2026.04
4,551.4-
2026.04
4,266.6-
2022.04
4,211.02211.94
2026.04
4,089.4-
2026.04
3,328.5-
2022.04
3,085.8842.76
2022.04
2,935.9544.11
2022.04
2,879.46929.55
2026.04
2,861.3-
2022.04
2,549.83501.08
2022.04
2,495.37185.11
2022.04
2,423.16602.43
2026.04
2,342.3-
2026.04
2,130.9-
2026.04
1,753.1-
2022.04
1,691.22976.96
2026.04
946.2-
2026.04
555.1-
2026.04
552.5-
2026.04
110.7-
2026.04
-262.1-
2026.04
-276.3-
2026.04
-281.3-