| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| General Multimodal Evaluation Suite (MMMU, MMBench, MME, ChartQA, AI2D, HallBench) | MMMU (Val)72.6 | 14 | 3mo ago | ||
| MMMU Pro | Qwen3-VL-32B-Instruct | Accuracy56.9 | 13 | 8d ago | |
| Aggregated Benchmarks | Qwen2.5-VL-3B | Average Score71 | 13 | 20d ago | |
| Combined 9 Benchmarks | Average Accuracy100 | 13 | 3mo ago | ||
| MME | Normalized Score66.63 | 12 | 26d ago | ||
| BLINK | Qwen3-VL-30B-A3B-Instruct | Accuracy67.7 | 12 | 1mo ago | |
| General Benchmarks | JARVIS | Average Score74 | 12 | 3mo ago | |
| MUIR | Accuracy77.6 | 10 | 2mo ago |