Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Humanoid v3

11,888Avg Final Return

DACER

-121.2962,996.5026,114.39,232.098May 24, 2024Sep 14, 2024Jan 5, 2025Apr 29, 2025Aug 20, 2025Dec 11, 2025Apr 4, 2026
Updated 11d ago

Evaluation Results

MethodLinks
2024.05
11,888
2024.05
10,829
2024.05
9,335
2024.05
6,869
2024.05
5,631
2024.05
5,291
2026.04
3,320.6
2026.04
3,228.1
2026.04
3,162.9
2026.04
2,996.4
2026.04
2,889.5
2026.04
2,843.2
2026.04
2,189.4
2026.04
1,829.5
2026.04
1,197.7
2024.05
965
2026.04
791.3
2026.04
629.2
2026.04
520.5
2026.04
505.9
2026.04
407.2
2026.04
398.3
2026.04
394.1
2026.04
362.9
2026.04
341.6
2026.04
340.6