Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Offline multitask Reinforcement Learning on Hopper backward

596Average Episodic Return

MOPO

177.92286.46395503.54Mar 10, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.03
596
2024.03
596
2024.03
367
2024.03
367
2024.03
269
2024.03
269
2024.03
261
2024.03
261
2024.03
220
2024.03
220
2024.03
194
2024.03
194