Share your thoughts, 1 month free Claude Pro on usSee more

Offline multitask Reinforcement Learning on Hopper stand

800Average Episodic Return

DiSPO

Updated 5mo ago

Evaluation Results

Method	Links
DiSPO 2024.03		800
MOPO 2024.03		800
USFA 2024.03		685
FB 2024.03		670
COMBO 2024.03		600
RaMP 2024.03		255