| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MathVision | Seed-1.5-thinking | Accuracy68.7 | 31 | 4d ago | |
| WeMath | Gemini-2.5-Pro-Thinking | Accuracy78 | 26 | 4d ago | |
| MathVista In-domain | ADPO | Overall Accuracy65.3 | 16 | 4d ago | |
| MMMU OOD | Base (Qwen2-VL-7B) | ARD69.2 | 12 | 4d ago | |
| V-Math | AutoTool (Qwen3-8B) | Accuracy53 | 5 | 4d ago |