| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MME Leaderboard (full) | HALC | Existence Score138.33 | 18 | 4d ago | |
| MME (test) | BLIVA | Communication Score136.43 | 17 | 3d ago | |
| CV-Bench | VLSI-2B | Accuracy90.1 | 17 | 4d ago | |
| MMBench | VILA-7B | MMB Score69.2 | 12 | 4d ago | |
| SEED-Bench | VILA-7B | SEED Score62.5 | 9 | 4d ago | |
| MME | VILA-7B | MMEP Score1,559.6 | 9 | 4d ago | |
| VIEScore | Gemini 2.5 Flash | CV (CIG-SC)1.32 | 7 | 4d ago | |
| MME full | REVIS | Perception Score1,717.44 | 7 | 4d ago | |
| LLaVA-Wilder | Phantom-7B | Accuracy83.7 | 7 | 4d ago | |
| MMStar | IDPruner | Average Score33.44 | 4 | 4d ago | |
| MMBench-CN | Score50.17 | 4 | 4d ago | ||
| Vision-language benchmark | Location Accuracy82.2 | 2 | 4d ago |