Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Hierarchical Planning on Minihack 5x5
Loading...
1,115
Token Cost
LLM + π
655.8
3,755.4
6,855
9,954.6
Jan 31, 2026
Token Cost
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Token Cost
Success Rate
LLM + π
Agent Architecture=LLM...
2026.01
1,115
-
TheoryCoder-2
Ablation Configuration...
2026.01
5,163
-
TheoryCoder-2
Ablation Configuration...
2026.01
5,163
-
TheoryCoder-2
Ablation Configuration...
2026.01
7,671
-
WorldCoder
Agent Architecture=Wor...
2026.01
8,144
-
LLM + P
Agent Architecture=LLM...
2026.01
12,595
-
Feedback
Search any
task
Search any
task