Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multimodal Mathematical Reasoning on WeMath mini (test)
Loading...
72.6
Accuracy
Claude-3.7-Sonnet
41.088
49.269
57.45
65.631
Jun 8, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Claude-3.7-Sonnet
#Data=/
2025.06
72.6
Perception-R1-7B
#Data=1.4K
2025.06
72
Qwen2.5-VL-72B-IT
#Data=/
2025.06
71.9
GPT-4o
#Data=/
2025.06
68.8
OpenVLThinker-7B
#Data=25K
2025.06
66.3
VLAA-Thinker-7B
#Data=25K
2025.06
66.3
MM-Eureka-7B
#Data=15K
2025.06
65.6
SophiaVL-R1-7B
#Data=130K
2025.06
64.8
R1-OneVision-7B
#Data=155K
2025.06
61.9
Qwen2.5-VL-7B-IT
#Data=/
2025.06
61.4
InternVL2.5-8B
#Data=/
2025.06
53.5
Qwen2-VL-7B-IT
#Data=/
2025.06
42.3
Feedback
Search any
task
Search any
task