Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multimodal Mathematical Reasoning on MathV
Loading...
63.16
Pass@1 Accuracy
Chain of Mindset (CoM)
19.3656
30.7353
42.105
53.4747
Feb 10, 2026
Pass@1 Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Pass@1 Accuracy
Chain of Mindset (CoM)
Base Model=Qwen3-VL-32...
2026.02
63.16
Direct I/O
Base Model=Qwen3-VL-32...
2026.02
55.92
CoM
Base Model=Gemini-2.0-...
2026.02
51
Chain of Mindset (CoM)
Base Model=Gemini-2.0-...
2026.02
51
MRP
Base Model=Gemini-2.0-...
2026.02
49.34
Zero-shot CoT
Base Model=Gemini-2.0-...
2026.02
48.36
Direct I/O
Base Model=Gemini-2.0-...
2026.02
48.03
Direct I/O
Base Model=Gemini-2.0-...
2026.02
48.03
ReAct
Base Model=Gemini-2.0-...
2026.02
47.37
Tree of Thoughts
Base Model=Gemini-2.0-...
2026.02
39.14
Chain of Code
Base Model=Gemini-2.0-...
2026.02
22
Meta-Reasoner
Base Model=Gemini-2.0-...
2026.02
21.05
Feedback
Search any
task
Search any
task