Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Offline Goal-Conditioned Reinforcement Learning on puzzle 4x5

9,600Success Rate

DQC

-3842,2084,8007,392Dec 11, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
9,600
2025.12
9,300
2025.12
3,300
2025.12
2,000
2025.12
2,000
2025.12
1,900
2025.12
100
2025.12
0
2025.12
0
2025.12
0