| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| RealWorldQA | DAPO | Accuracy71.3 | 62 | 6d ago | |
| MMMU | Accuracy75.1 | 35 | 12d ago | ||
| VisNumBench | Vision-SR1 | Accuracy44.6 | 30 | 20d ago | |
| MMMU Pro | PDCR | Accuracy50.7 | 30 | 20d ago | |
| SeedBench 2+ | DPA-Qwen3-32B | SeedBench2+ Score67.7 | 11 | 16d ago | |
| ScienceQA | IREASONER | Accuracy89.92 | 6 | 3mo ago | |
| AI2D | IREASONER | Accuracy83.89 | 6 | 3mo ago | |
| InfoGraphic-VQA (val) | IREASONER | Accuracy81.56 | 6 | 3mo ago | |
| 10-benchmark Global Set | HEED | Average Score78.79 | 5 | 15d ago | |
| SQA | Unicorn | SQA Score68.81 | 4 | 6d ago | |
| MME | Unicorn | MME Score60.24 | 4 | 6d ago |