Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Offline Goal-Conditioned Reinforcement Learning on puzzle-4x6-1B

9,100Success Rate

NS

-2602,1704,6007,030Dec 11, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
9,100
2025.12
8,300
2025.12
6,400
2025.12
3,300
2025.12
2,800
2025.12
1,900
2025.12
900
2025.12
600
2025.12
400
2025.12
100