| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| NaturalBench | Qwen2.5-VL-7B + SRPO | General Score78.62 | 21 | 23d ago | |
| LogicVista | Qwen2.5-VL-7B + SRPO | Score50.56 | 21 | 23d ago | |
| MMMU-Pro | Qwen2.5-VL-7B + SRPO | Score38.09 | 21 | 23d ago | |
| EMMA full | OpenAI-o1 | Accuracy45.7 | 14 | 3mo ago | |
| MMStar (val) | Qwen2.5-VL-72B-IT | Accuracy70.8 | 13 | 3mo ago | |
| M3CoT | MiMo-VL-7B-OpenMMReasoner | Pass@1 Accuracy78.21 | 11 | 3mo ago | |
| MMMU (val) | Qwen3-VL-8B-DeepVision | Pass@1 Accuracy71.33 | 11 | 3mo ago | |
| General Benchmarks | MUPO | Top-1 Accuracy57.8 | 6 | 2mo ago |