Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical & Geometric Reasoning on MMK12
Loading...
86.4
Accuracy@8
Qwen2.5-VL-32B + DAPO
40.744
52.597
64.45
76.303
Oct 10, 2025
Accuracy@8
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy@8
Qwen2.5-VL-32B + DAPO
Model Scale=32B, Algor...
2025.10
86.4
Qwen2.5-VL-32B + VPPO
Model Scale=32B, Algor...
2025.10
86.3
Qwen2.5-VL-7B + VPPO
Model Scale=7B, Algori...
2025.10
82.8
Qwen2.5-VL-7B + DAPO
Model Scale=7B, Algori...
2025.10
82.1
Qwen2.5-VL-32B + GRPO
Model Scale=32B, Algor...
2025.10
80.7
PAPO-D-7B
Model Scale=7B, Traini...
2025.10
80.6
MM-Eureka-32B
Model Scale=32B, Promp...
2025.10
73.4
Qwen2.5-VL-7B + GRPO
Model Scale=7B, Algori...
2025.10
72.3
R1-ShareVL-7B
Model Scale=7B, Traini...
2025.10
70.9
Qwen2.5-VL-32B
Model Scale=32B
2025.10
68.8
VL-Rethinker-7B
Model Scale=7B, Traini...
2025.10
68.3
MM-Eureka-7B
Model Scale=7B, Traini...
2025.10
67.5
ThinkLite-7B
Model Scale=7B, Traini...
2025.10
62.6
NoisyRollout-32B
Model Scale=32B, Promp...
2025.10
60.2
NoisyRollout-7B
Model Scale=7B, Traini...
2025.10
50
Qwen2.5-VL-7B
Model Scale=7B
2025.10
42.5
Feedback
Search any
task
Search any
task