| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MMBench | BAGEL | Accuracy85 | 104 | 3d ago | |
| MMMU | VINO | Overall Score67.4 | 28 | 3d ago | |
| CVBench-2D | Jigsaw + CARE | Accuracy77.76 | 22 | 3d ago | |
| RealworldQA | Overall Score75.4 | 17 | 3d ago | ||
| MMVP | BAGEL-7B | Accuracy69.3 | 12 | 3d ago | |
| SEED | LLaVA-1.5-13B | Accuracy68.2 | 11 | 2d ago | |
| LLaVA-W | VIG training + VAR | Score63 | 10 | 3d ago | |
| LLaVA-Wild | LLaVA-FastV (k=3, r=0.75) | LLaVA-Wild Accuracy74.2 | 8 | 3d ago | |
| VizWiz (test) | LLaVA-FastV (k=3, r=0.75) | VizWiz Score54.7 | 8 | 3d ago | |
| MMBench v1.0 (test) | LLaVA-1.5 13B + VIG training | Accuracy68.67 | 6 | 3d ago | |
| MMVet v1.0 (test) | LLaVA-1.5 13B + VIG training | Score36.87 | 6 | 3d ago | |
| LLaVAW v1.0 (test) | LLaVA-1.5 13B + VIG training | Score73.45 | 6 | 3d ago | |
| Common Visual Understanding Benchmarks (GQA, MMB, MME, POPE, SEED, SQA, VQAv2) | Upper Bound | GQA Score61.1 | 5 | 3d ago |