Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Visual Logical Reasoning on LogicVista (BoN@8)
Loading...
60.4
BoN@8 Accuracy
Claude-3.5-Sonnet
35.4192
41.9046
48.39
54.8754
Mar 17, 2026
BoN@8 Accuracy
Delta_8 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
BoN@8 Accuracy
Delta_8 Score
Claude-3.5-Sonnet
Reranking Strategy=Bes...
2026.03
60.4
-
InternVL2.5-38B + EVPV-PRM
Policy Model=InternVL2...
2026.03
58.74
10.84
InternVL2.5-38B + VisualPRM
Policy Model=InternVL2...
2026.03
53.7
5.8
GPT-4o
Reranking Strategy=Bes...
2026.03
52.8
-
Gemini-2.0-Flash
Reranking Strategy=Bes...
2026.03
52.3
-
InternVL2.5-26B + EVPV-PRM
Policy Model=InternVL2...
2026.03
51.72
12.08
InternVL2.5-26B + VisualPRM
Policy Model=InternVL2...
2026.03
51
11.4
InternVL2.5-38B
Policy Model=InternVL2...
2026.03
47.9
-
InternVL2.5-8B + EVPV-PRM
Policy Model=InternVL2...
2026.03
45.33
8.95
InternVL2.5-8B + VisualPRM
Policy Model=InternVL2...
2026.03
43.8
7.8
InternVL2.5-26B
Policy Model=InternVL2...
2026.03
39.64
-
InternVL2.5-8B
Policy Model=InternVL2...
2026.03
36.38
-
Feedback
Search any
task
Search any
task