Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on DynaMath

67.2Accuracy

SwimBird

-2.4779215.6115433.70151.79046Jun 5, 2025Jul 19, 2025Sep 1, 2025Oct 15, 2025Nov 28, 2025Jan 11, 2026Feb 24, 2026
Updated 25d ago

Evaluation Results

MethodLinks
2026.02
67.2
2026.02
65.3
2026.02
61.6
2026.02
57.2
2026.02
57.2
2025.11
57.2
2025.12
55.9
2025.12
55.2
2026.02
55
2026.02
55
2025.11
55
2025.12
54.4
53.3
2026.02
53.3
2025.11
53.3
2025.12
53.2
2025.12
52.2
2025.12
52
2025.12
45.2
2025.12
42.1
2025.06
42.1
2025.12
40
2025.12
39.7
2025.06
39.7
2025.12
38.6
2025.06
38.3
2025.06
37.9
2025.06
36.5
2025.06
35.9
2025.06
35.9
2025.06
35.3
2025.06
35.1
2025.06
34.9
2025.06
34.5
2025.06
33.5
2025.06
33.3
2025.06
32.7
2025.06
31.3
2025.06
30.7
2025.06
30.5
2025.06
30.5
2025.06
29.7
2025.06
29.3
2025.06
28.5
2025.06
27.5
2025.06
26.3
2025.06
25.5
2025.06
25
2025.06
22.6
2025.06
21.4
2025.06
20.8
2025.06
20.4
2025.06
19.4
2025.06
13.4
2026.02
0.262
2026.02
0.259
2026.02
0.246
2026.02
0.202