| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MMMU, SEED, OCRBench, VizWiz, ScienceQA, TextVQA (test/val) | MMMU Score61.9 | 42 | 1mo ago | ||
| MME | MME Score78.7 | 26 | 1mo ago | ||
| MMBench Chinese (dev) | Accuracy72.8 | 22 | 1mo ago | ||
| Image Benchmarks HallBench, MME, TextVQA, ChartQA, AI2D, RealWorldQA, CCBench, OCRVQA, SQA-IMG, POPE | Qwen2.5-VL-7B | HallBench Score46.5 | 13 | 9d ago | |
| MM-Vet | MM-Vet Score44.6 | 4 | 1mo ago | ||
| MMMU | Accuracy56.8 | 4 | 1mo ago |