Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multimodal Maze Solving on MAZE
Loading...
84
Pass@1 Accuracy
CoM
23.16
38.955
54.75
70.545
Feb 10, 2026
Pass@1 Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Pass@1 Accuracy
CoM
Base Model=Gemini-2.0-...
2026.02
84
Direct I/O
Base Model=Gemini-2.0-...
2026.02
76.5
MRP
Base Model=Gemini-2.0-...
2026.02
76.5
Zero-shot CoT
Base Model=Gemini-2.0-...
2026.02
76
ReAct
Base Model=Gemini-2.0-...
2026.02
71.5
Tree of Thoughts
Base Model=Gemini-2.0-...
2026.02
69.5
Meta-Reasoner
Base Model=Gemini-2.0-...
2026.02
30.5
Chain of Code
Base Model=Gemini-2.0-...
2026.02
25.5
Feedback
Search any
task
Search any
task