Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline multitask Reinforcement Learning on Hopper hopper-forward

982Average Episodic Return

COMBO

430.8573.9717860.1Mar 10, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.03
982
2024.03
566
2024.03
493
2024.03
487
2024.03
470
2024.03
452