Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline multitask Reinforcement Learning on D4RL Antmaze large-diverse

359Avg Episodic Return

DiSPO

118.76181.13243.5305.87Mar 10, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.03
359
2024.03
244
2024.03
226
2024.03
215
2024.03
181
2024.03
132
2024.03
128