Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Humanoid v5 (delta=[0.8^6, 0.5^6, 0.2^5], kappa=4.0) (test)

5,620Return

DD-SRad

2,1363,040.53,9454,849.5May 5, 2026
Updated 27d ago

Evaluation Results

MethodLinks
2026.05
5,6200.0380.489
2026.05
5,4980.0370.203
2026.05
5,3130.0640.184
2026.05
5,2910.0230.179
2026.05
5,2040.0340.159
2026.05
5,1910.0670.249
2026.05
4,8680.0550.662
2026.05
4,5580.110.053
2026.05
4,4200.0260.465
2026.05
3,8100.0870.015
2026.05
2,7420.4930.024
2026.05
2,2700.4960.053