| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| EMMA full | OpenAI-o1 | Accuracy45.7 | 14 | 1mo ago | |
| MMStar (val) | Qwen2.5-VL-72B-IT | Accuracy70.8 | 13 | 1mo ago | |
| M3CoT | MiMo-VL-7B-OpenMMReasoner | Pass@1 Accuracy78.21 | 11 | 1mo ago | |
| MMMU (val) | Qwen3-VL-8B-DeepVision | Pass@1 Accuracy71.33 | 11 | 1mo ago | |
| General Benchmarks | MUPO | Top-1 Accuracy57.8 | 6 | 16d ago |