Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Offline multitask Reinforcement Learning on Hopper backward
Loading...
596
Average Episodic Return
MOPO
177.92
286.46
395
503.54
Mar 10, 2024
Average Episodic Return
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Episodic Return
MOPO
2024.03
596
MOPO
2024.03
596
DiSPO
2024.03
367
DiSPO
2024.03
367
FB
2024.03
269
FB
2024.03
269
USFA
2024.03
261
USFA
2024.03
261
RaMP
2024.03
220
RaMP
2024.03
220
COMBO
2024.03
194
COMBO
2024.03
194
Feedback
Search any
task
Search any
task