Share your thoughts, 1 month free Claude Pro on usSee more

Offline multitask Reinforcement Learning on Hopper backward

596Average Episodic Return

MOPO

Updated 5mo ago

Evaluation Results

Method	Links
MOPO 2024.03		596
MOPO 2024.03		596
DiSPO 2024.03		367
DiSPO 2024.03		367
FB 2024.03		269
FB 2024.03		269
USFA 2024.03		261
USFA 2024.03		261
RaMP 2024.03		220
RaMP 2024.03		220
COMBO 2024.03		194
COMBO 2024.03		194