Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reinforcement Learning on AdroitHandDoor v1

1,725Average Return

Causal PBRS

-109.872366.489842.851,319.211Sep 24, 2025Oct 17, 2025Nov 9, 2025Dec 2, 2025Dec 25, 2025Jan 17, 2026Feb 10, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
1,725
2026.02
1,659
2026.02
1,472
2026.02
1,289
2026.02
415
2026.02
308
2026.02
105
2026.02
71
2026.02
-27
2026.02
-27
-36
2025.09
-39.3