Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Hopper v3

4,104Average Final Return

DACER

-135.456965.1722,065.83,166.428May 24, 2024Sep 14, 2024Jan 5, 2025Apr 29, 2025Aug 20, 2025Dec 11, 2025Apr 4, 2026
Updated 11d ago

Evaluation Results

MethodLinks
2024.05
4,104
2024.05
3,660
2024.05
3,569
2024.05
3,474
2024.05
2,647
2024.05
2,644
2026.04
2,509.8
2024.05
2,483
2026.04
2,435.2
2026.04
2,374.8
2026.04
2,230.8
2026.04
2,136.3
2026.04
2,045.2
2026.04
1,942.3
2026.04
1,917.6
2026.04
1,818.9
2026.04
1,536.6
2026.04
1,526.7
2026.04
1,403.1
2026.04
1,359.5
2026.04
1,002.6
2026.04
595.5
2026.04
298.6
2026.04
88.8
2026.04
39.7
2026.04
27.6