Share your thoughts, 1 month free Claude Pro on usSee more

Offline Multitask Reinforcement Learning on Hopper jump

832Average Episodic Return

DiSPO

Updated 5mo ago

Evaluation Results

Method	Links
DiSPO 2024.03		832
MOPO 2024.03		753
USFA 2024.03		746
FB 2024.03		726
COMBO 2024.03		670
RaMP 2024.03		652