| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| HRBench 4K | Kimi K2.5 + ETCHR | Pass@186.8 | 26 | 6d ago | |
| MMVP | UniMRG | Accuracy74.67 | 24 | 2d ago | |
| V* | Visual Para-Thinker | Accuracy78.8 | 13 | 3mo ago | |
| MMVP (test) | Qwen2.5-VL | MMVP Score75.33 | 11 | 3mo ago | |
| HRBench 8K | Kimi K2.5 + ETCHR | Pass@181.1 | 10 | 9d ago | |
| V*Bench | Kimi K2.5 + ETCHR | Pass@187.4 | 10 | 9d ago | |
| AI2D | HEED | Score85.3 | 10 | 15d ago | |
| InfoVQA | Teacher | Score82.7 | 10 | 15d ago | |
| TextVQA | HEED | Score83.7 | 10 | 15d ago | |
| ChartQA | HEED | Score88.6 | 10 | 15d ago | |
| DocVQA | Teacher | Score95.4 | 10 | 15d ago | |
| OCRBench v2 | HEED | Score63.9 | 10 | 15d ago | |
| EndoAgentBench | EndoCogniAgent | Localization Classification Accuracy75.29 | 8 | 15d ago | |
| VStarBench | MIRROR(ours) | Score83.77 | 7 | 3mo ago | |
| HRB8K | Qwen2.5-VL-72B | Score77.1 | 7 | 3mo ago | |
| MME-RealWorld Lite | MIRROR(ours) | Score51.49 | 6 | 3mo ago | |
| Fine-grained Perception 6 datasets | HEED | Average Score83.18 | 5 | 15d ago |