| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MMStar latest (test) | CP67.2 | 30 | 1mo ago | ||
| MME-RealWorld Lite | TTSP | Overall Score59 | 29 | 4d ago | |
| OPV2V | CodeAlign | AP3093.39 | 24 | 1mo ago | |
| TWI-oriented TreeBench online setting | RTWI | Accuracy70.5 | 16 | 1mo ago | |
| HallusionBench | AVAR-Thinker | Score59.5 | 15 | 1mo ago | |
| MMStar | Score68.8 | 15 | 1mo ago | ||
| MME total perception score | ICLA | Total Perception Score1,711 | 15 | 1mo ago | |
| TWI-oriented TreeBench (offline) | RTWI | Accuracy71.1 | 12 | 1mo ago | |
| CVBench (test) | VaLR-M | Accuracy87.6 | 11 | 1mo ago | |
| V* (test) | VaLR-M | Accuracy86.9 | 11 | 1mo ago | |
| MMStar (test) | VaLR-M | Accuracy72.3 | 11 | 1mo ago | |
| MMVP (test) | GPT-4o | Accuracy68.7 | 11 | 1mo ago | |
| BLINK (test) | VaLR-M | Accuracy0.647 | 11 | 1mo ago | |
| TSR-Suite Task 2 | TimeOmni-1 | Accuracy64 | 8 | 1mo ago | |
| TSR-Suite Task 1 | TimeOmni-1 | Accuracy87.7 | 8 | 1mo ago | |
| MME-RealWorld Lite (test) | FOCUS | OCR83.6 | 3 | 1mo ago |