| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MME | Jigsaw + CARE | Score2,565.72 | 557 | 2d ago | |
| MM-Vet | InternVL3.5-8B-Masters | Accuracy85.6 | 122 | 2d ago | |
| MMBench | MM1 | MMB Score79.7 | 118 | 2d ago | |
| SEED-Bench | Jigsaw | Accuracy77.01 | 80 | 3d ago | |
| MMBench CN | mPlug-Owl3 | Accuracy74.3 | 57 | 2d ago | |
| MM-Bench | CoLLaVO | Accuracy83 | 57 | 3d ago | |
| MMStar | Jigsaw + CARE | Accuracy65.8 | 46 | 3d ago | |
| LLaVA-Bench | LLaVA-v1.6 (7B) w/ STIC | LLaVA-Bench Score79.2 | 38 | 2d ago | |
| LLaVA-bench in-the-wild | MoE-LLaVA-2.7Bx4-Top2† | Score97.3 | 36 | 2d ago | |
| MMB | Score85.31 | 27 | 3d ago | ||
| LLaVA-Bench-Wild (LLaVA-W) | MoE-LLaVA | Overall Score97.3 | 24 | 3d ago | |
| MME | Total Score1,513.8 | 16 | 2d ago | ||
| SEED-Bench | LLaVA-1.5-7B | SEED-Bench Score66.8 | 15 | 3d ago | |
| SEED Image | Accuracy77.1 | 15 | 3d ago | ||
| MME-P | BAGEL | MME-P Score1,687 | 14 | 3d ago | |
| MMBench EN (test) | InternVL 1.2 | Accuracy82.2 | 14 | 3d ago | |
| MMT-Bench | Mix + CL + CARE | Accuracy62.65 | 13 | 3d ago | |
| MME (test) | Score94.9 | 12 | 2d ago | ||
| MME | HiMAP | Accuracy1,821.3 | 12 | 2d ago | |
| DID-Bench | Ours | CLIP-S Score41.19 | 12 | 2d ago | |
| MMBench-CN (test) | InternVL 1.5 | Accuracy0.82 | 12 | 3d ago | |
| Touchstone | Emu2-Chat | Score703.8 | 11 | 3d ago | |
| SEED-Bench Image | RADIOv2.5-H | Accuracy77.39 | 10 | 2d ago | |
| MMBench Chinese V1.1 | Accuracy80.1 | 10 | 3d ago | ||
| MMBench English V1.1 | Accuracy78.5 | 10 | 3d ago |