Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Geometric Reasoning on Geomverse-109
Loading...
28.9
Accuracy@1
VisuoThink
4.668
10.959
17.25
23.541
Apr 12, 2025
Accuracy@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy@1
VisuoThink
Backbone=GPT-4o
2025.04
28.9
VisuoThink
Backbone=Claude-3.5-so...
2025.04
27.8
VisuoThink w/o rollout search
Backbone=Claude-3.5-so...
2025.04
26.7
VisuoThink
Backbone=Qwen2-VL-72B-...
2025.04
25.6
VisuoThink w/o rollout search
Backbone=GPT-4o
2025.04
24.4
VisuoThink w/o rollout search
Backbone=Qwen2-VL-72B-...
2025.04
19
VisualSketchpad + Equation Solver
Backbone=Claude-3.5-so...
2025.04
17.8
VisualSketchpad
Backbone=Claude-3.5-so...
2025.04
16.7
CoT
Backbone=Claude-3.5-so...
2025.04
14.4
VisualSketchpad + Equation Solver
Backbone=GPT-4o
2025.04
13.3
CoT
Backbone=GPT-4o
2025.04
11.1
VisualSketchpad + Equation Solver
Backbone=Qwen2-VL-72B-...
2025.04
11.1
VisualSketchpad
Backbone=GPT-4o
2025.04
8.9
VisualSketchpad
Backbone=Qwen2-VL-72B-...
2025.04
6.7
CoT
Backbone=Qwen2-VL-72B-...
2025.04
5.6
Feedback
Search any
task
Search any
task