Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Offline Goal-Conditioned Reinforcement Learning on cube-octuple-1B

3,400Success Rate

SHARSA

-1367821,7002,618Dec 11, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
3,400
2025.12
3,400
2025.12
2,800
2025.12
2,000
2025.12
900
2025.12
300
2025.12
0
2025.12
0
2025.12
0
2025.12
0