Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on MountainCarContinuous v0 (Average Agent Reward)

97Average Agent Reward

SA-DT

-328.1-217.7375-107.3752.9875Jun 20, 2024Oct 2, 2024Jan 15, 2025Apr 30, 2025Aug 13, 2025Nov 26, 2025Mar 11, 2026
Updated 17d ago

Evaluation Results

MethodLinks
2026.03
97-
2026.03
96-
2026.03
95-
2026.03
94-
2024.06
93.630.21
2024.06
93.630.21
2024.06
93.630.21
2024.06
93.630.21
2024.06
93.630.21
2024.06
93.630.21
2024.06
93.630.21
2024.06
93.630.21
2024.06
93.620.35
2026.01
93.52-
2024.06
93.390.51
2024.06
93.180.41
2024.06
93.150.81
2024.06
93.030.37
2024.06
92.820.89
2024.06
92.720.75
2024.06
92.670.85
2024.06
91.981.47
2024.06
89.14.78
2024.06
81.321.63
2024.06
81.12.02
2024.06
79.341.02
2024.06
76.311.74
2024.06
76.233.03
2024.06
74.27.2
2024.06
71.3513.15
2024.06
71.2512.27
2024.06
69.5810.57
2024.06
66.3612.27
2024.06
66.1920.54
2024.06
65.019.69
2024.06
63.1219.53
2024.06
62.4225.91
2024.06
62.325.59
2024.06
6225.12
2024.06
61.9421.83
2024.06
61.6417.19
2024.06
61.5228.84
2024.06
61.0916.84
2024.06
59.627.32
2024.06
57.1334.55
2026.03
4-
2026.03
-10-
2026.01
-311.75-