Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Hopper (forward)

982Average Episodic Return

COMBO

430.8573.9717860.1Mar 10, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.03
982
2024.03
832
2024.03
753
2024.03
746
2024.03
726
2024.03
670
2024.03
652
2024.03
566
2024.03
493
2024.03
487
2024.03
470
2024.03
452