| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MathVision | Accuracy86 | 183 | 6d ago | ||
| WeMath | Accuracy98.7 | 168 | 4d ago | ||
| MMK12 | Qwen2.5-VL-7B + PGPO | Accuracy80.83 | 24 | 16d ago | |
| MathVista In-domain | ADPO | Overall Accuracy65.3 | 16 | 1mo ago | |
| MMMU OOD | Base (Qwen2-VL-7B) | ARD69.2 | 12 | 1mo ago | |
| MathVerse | PAPO_D | Avg@8 Accuracy68.58 | 10 | 4d ago | |
| MathVista | PAPO_G | Avg@8 Accuracy69.53 | 10 | 4d ago | |
| R1-OV Bench | AVATAR | Accuracy38.9 | 8 | 19d ago | |
| V-Math | AutoTool (Qwen3-8B) | Accuracy53 | 5 | 1mo ago |