Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Spatial Reasoning on MAZE
Loading...
85.5
Pass@1
Chain of Mindset (CoM)
76.14
78.57
81
83.43
Feb 10, 2026
Pass@1
Updated 3d ago
Evaluation Results
Method
Method
Links
Pass@1
Chain of Mindset (CoM)
Base Model=Qwen3-VL-32...
2026.02
85.5
Chain of Mindset (CoM)
Base Model=Gemini-2.0-...
2026.02
84
Direct I/O
Base Model=Qwen3-VL-32...
2026.02
81.5
Direct I/O
Base Model=Gemini-2.0-...
2026.02
76.5
Feedback
Search any
task
Search any
task