Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Visual Mathematical Reasoning on DynaMath

81.42Accuracy

Gemini2.5-Pro

17.959234.434650.9167.3854Jan 5, 2026Jan 15, 2026Jan 26, 2026Feb 6, 2026Feb 17, 2026Feb 28, 2026Mar 11, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
81.42
2026.03
79.5
2026.03
77.41
2026.03
77.39
2026.03
75.78
2026.03
75.54
2026.03
75.05
2026.03
73.69
2026.03
73.25
2026.03
73.2
2026.03
72.4
2026.03
72.19
2026.03
71.67
2026.03
71.38
2026.03
71.22
2026.03
71
2026.03
69.4
2026.03
68.28
2026.03
68.12
2026.03
66.17
2026.03
65.44
2026.03
63.61
2026.03
62.37
2026.01
60.9
2026.02
59.04
2026.02
58.98
2026.02
58.7
2026.02
58.62
2026.02
58.4
2026.02
58.26
2026.02
58.06
56.3
2026.01
53.9
2026.02
51.24
2026.02
50.02
2026.02
46.33
2026.01
46.2
44.9
2026.01
42.5
39.7
2026.01
37.7
2026.01
37.3
2026.01
25.5
2026.01
21
2026.01
20.4