Share your thoughts, 1 month free Claude Pro on usSee more

Offline multitask Reinforcement Learning on Hopper hopper-forward

982Average Episodic Return

COMBO

Updated 5mo ago

Evaluation Results

Method	Links
COMBO 2024.03		982
DiSPO 2024.03		566
MOPO 2024.03		493
USFA 2024.03		487
RaMP 2024.03		470
FB 2024.03		452