Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multimodal Reasoning on WeMath

63.8Accuracy

No Compression

19.18430.76742.3553.933Feb 3, 2026Feb 6, 2026Feb 10, 2026Feb 13, 2026Feb 17, 2026Feb 20, 2026Feb 24, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
63.8562.68.8
2026.02
63.4419.56.6
2026.02
6397.71.6
2026.02
62.3--
2026.02
61.6313.55.1
2026.02
61.58--
2026.02
60.23--
2026.02
59.2--
2026.02
58.61--
2026.02
58.53--
2026.02
58.21--
2026.02
58.15--
2026.02
58.02--
2026.02
57.99--
2026.02
57.9--
2026.02
57.59--
2026.02
57.59--
2026.02
57.54--
2026.02
57.44--
2026.02
55.8572.510.3
2026.02
55.4460.58.3
2026.02
54.999.61.8
2026.02
54.89--
2026.02
54.6334.26.1
2026.02
54.14--
2026.02
52.82--
2026.02
52.41--
2026.02
49.5--
2026.02
47.7--
2026.02
45.6--
2026.02
43.3--
2026.02
40.7--
2026.02
39.6--
2026.02
39.3--
2026.02
38.9--
2026.02
38.9--
2026.02
38.8--
2026.02
38.1--
2026.02
38.1--
34.6--
2026.02
34.6--
2026.02
34.5--
20.9--