| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MME-RealWorld Lite | PFlowNet | Overall Score67 | 46 | 29d ago | |
| MMStar latest (test) | CP67.2 | 30 | 3mo ago | ||
| OPV2V | CodeAlign | AP3093.39 | 24 | 3mo ago | |
| TreeBench | PFlowNet | Overall Accuracy55.3 | 17 | 29d ago | |
| HRBen 8K | AXPO | Pass@490 | 16 | 6d ago | |
| HRBen 4K | AXPO | Pass@491 | 16 | 6d ago | |
| Visual Probe | AXPO | Pass@467.9 | 16 | 6d ago | |
| HRBen 8K | Pass@178.9 | 16 | 6d ago | ||
| Visual Probe | SFT + AXPO | Pass@145.8 | 16 | 6d ago | |
| V* | Pass@190.2 | 16 | 6d ago | ||
| TWI-oriented TreeBench online setting | RTWI | Accuracy70.5 | 16 | 3mo ago | |
| HallusionBench | AVAR-Thinker | Score59.5 | 15 | 3mo ago | |
| MMStar | Score68.8 | 15 | 3mo ago | ||
| MME total perception score | ICLA | Total Perception Score1,711 | 15 | 3mo ago | |
| V* | Overall Score95.7 | 13 | 14d ago | ||
| TWI-oriented TreeBench (offline) | RTWI | Accuracy71.1 | 12 | 3mo ago | |
| CVBench (test) | VaLR-M | Accuracy87.6 | 11 | 3mo ago | |
| V* (test) | VaLR-M | Accuracy86.9 | 11 | 3mo ago | |
| MMStar (test) | VaLR-M | Accuracy72.3 | 11 | 3mo ago | |
| MMVP (test) | GPT-4o | Accuracy68.7 | 11 | 3mo ago | |
| BLINK (test) | VaLR-M | Accuracy0.647 | 11 | 3mo ago | |
| HRbench 8K | ZoomEye | FSP88.5 | 10 | 14d ago | |
| TSR-Suite Task 2 | TimeOmni-1 | Accuracy64 | 8 | 3mo ago | |
| TSR-Suite Task 1 | TimeOmni-1 | Accuracy87.7 | 8 | 3mo ago | |
| Custom Pedestrian-Crossing Dataset (Evening) | DeepIPC | Segmentation IoU80.4 | 3 | 1d ago |