Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline multitask Reinforcement Learning on D4RL Antmaze umaze

593Average Episodic Return

DiSPO

445.32483.66522560.34Mar 10, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.03
593
2024.03
574
2024.03
571
2024.03
469
2024.03
462
2024.03
459
2024.03
451