Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on MuJoCo Ant

7,889.1Average Return

Oracle-TC RARL

423.8762,361.9634,300.056,238.137May 24, 2023Nov 10, 2023Apr 28, 2024Oct 15, 2024Apr 3, 2025Sep 20, 2025Mar 10, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2024.06
7,889.1
2024.06
7,739.65
2024.06
7,558.58
2024.06
7,500.88
2024.06
7,366.9
2024.06
7,123.07
2024.06
6,912.76
2026.03
5,984
2024.06
5,958.21
2026.03
5,925
2026.03
5,700
2024.06
5,577.41
2023.05
5,230
2023.05
5,118
2026.03
5,071
2026.03
4,887
2024.06
4,684.83
2024.06
4,650.55
2026.03
4,400
2026.03
2,858
2024.06
2,600.43
2026.03
2,122
2026.03
977
2026.03
711