Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Multitask Reinforcement Learning on Hopper jump

832Average Episodic Return

DiSPO

644.8693.4742790.6Mar 10, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.03
832
2024.03
753
2024.03
746
2024.03
726
2024.03
670
2024.03
652