Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Ant v3

9,108Average Final Return

DACER

-451.5762,030.2374,512.056,993.863May 24, 2024Sep 14, 2024Jan 5, 2025Apr 29, 2025Aug 20, 2025Dec 11, 2025Apr 4, 2026
Updated 11d ago

Evaluation Results

MethodLinks
2024.05
9,108
2024.05
7,086
2024.05
6,427
2024.05
6,203
2024.05
6,184
2024.05
6,156
2024.05
4,549
2026.04
4,373.3
2026.04
3,852.3
2026.04
3,754.1
2026.04
3,279.2
2026.04
3,085.2
2026.04
3,010.6
2026.04
2,824.9
2026.04
2,694.3
2026.04
2,419.7
2026.04
2,069.5
2026.04
1,093.6
2026.04
924.3
2026.04
881.9
2026.04
880.7
2026.04
817.9
2026.04
697.4
2026.04
-74.9
2026.04
-79.2
2026.04
-83.9