Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Mathematical Reasoning on LogicVista

75.2Accuracy

Gemini-2.5-Pro-Thinking

39.3248.63557.9567.265Dec 19, 2025Jan 3, 2026Jan 19, 2026Feb 3, 2026Feb 19, 2026Mar 6, 2026Mar 22, 2026
Updated 25d ago

Evaluation Results

MethodLinks
2025.12
75.2
2025.12
71.8
2025.12
64.4
2025.12
63.8
2025.12
52.3
2025.12
50.9
2025.12
50.8
2025.12
50.2
2025.12
49.9
2026.03
49.7
2026.03
49.6
2026.03
49.4
2026.03
49.2
2026.03
49.1
2026.03
49
2026.03
48.8
2026.03
48.6
2025.12
48.5
2026.03
48.1
2026.03
47.9
2026.03
47.9
2026.03
47.8
2026.03
47.3
2026.03
47.2
2026.03
46.9
2026.03
46.9
2025.12
46.8
2026.03
46.5
2026.03
46.3
2026.03
46.3
2025.12
45
2025.12
44.5
2025.12
41.4
2025.12
40.7