Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Offline Reinforcement Learning on D4RL v2 (MuJoCo M/MR/ME + Adroit Tasks)

87.6Average Score

DOIT

12.61632.08351.5571.017Jan 28, 2026Jan 31, 2026Feb 4, 2026Feb 7, 2026Feb 11, 2026Feb 14, 2026Feb 18, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
87.6------------
2026.02
86.69896.910854.147.693.58684.4110.7---
2026.02
86.382.4100.7110.750.647.596.185.194.3109.7---
2026.02
86.1------------
2026.02
82.1------------
2026.02
80.2------------
2026.02
76.966.394.791.547.444.286.778.373.9109.6---
2026.02
75.358.596.8107.244.242.279.879.761.2108.4---
2026.01
58.169.245.981.346.343.146.965.955.496.355.585.35.8
2026.01
49.55545.127.928.534.422.354.729.561.391.5117.825.9
2026.01
38.960.923.556.643.327.737.247.327.650.95.785.70.3
2026.01
35.86116.251.735.634.114.334.217.7385171.54.3
2026.01
31.850.113.243.241.716.339.754.113.82631.341.210.7
2026.01
30.955.619.136.842.826.333.153.715.542.510.235.8-1.1
2026.01
26.355.912.549.737.123.632.45.934.456.58.3-0.1-0.3
2026.01
24.428.819.738.240.225.233.725.42.535.133.89.50.6
2026.01
15.530.711.322.625.929.123.511.29.312.409.50.6