| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| SEED-Bench | Accuracy74.74 | 34 | 13d ago | ||
| MMBench | Vanilla | MMB Score69.3 | 19 | 1mo ago | |
| MME | Vanilla | MME Score1,862 | 18 | 4d ago | |
| MME Leaderboard (full) | HALC | Existence Score138.33 | 18 | 1mo ago | |
| MME (test) | BLIVA | Communication Score136.43 | 17 | 1mo ago | |
| CV-Bench | VLSI-2B | Accuracy90.1 | 17 | 1mo ago | |
| VIEScore | Gemini 2.5 Flash | CV (CIG-SC)1.32 | 7 | 1mo ago | |
| MME full | REVIS | Perception Score1,717.44 | 7 | 1mo ago | |
| LLaVA-Wilder | Phantom-7B | Accuracy83.7 | 7 | 1mo ago | |
| MM-Vet | DSCA | Overall Score57.8 | 4 | 9d ago | |
| MMStar | IDPruner | Average Score33.44 | 4 | 1mo ago | |
| MMBench-CN | Score50.17 | 4 | 1mo ago | ||
| Vision-language benchmark | Location Accuracy82.2 | 2 | 1mo ago |