Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Hierarchical Planning on BabyAI Unlock
Loading...
5,705
Token Cost
LLM + π
2,015.68
26,918.59
51,821.5
76,724.41
Jan 31, 2026
Token Cost
Success
Updated 4d ago
Evaluation Results
Method
Method
Links
Token Cost
Success
LLM + π
Agent Architecture=LLM...
2026.01
5,705
-
TheoryCoder-2
Ablation Configuration...
2026.01
33,116
-
TheoryCoder-2
Ablation Configuration...
2026.01
33,116
-
TheoryCoder-2
Ablation Configuration...
2026.01
41,734
-
LLM + P
Agent Architecture=LLM...
2026.01
50,071
-
WorldCoder
Agent Architecture=Wor...
2026.01
97,938
-
Feedback
Search any
task
Search any
task