| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| High-resolution Visual Understanding | HR-Bench 8K | FSP95 | 73 | |
| High-resolution perception | HR Bench 4K | Overall Score87.87 | 44 | |
| High-Resolution Multimodal Reasoning | HR-Bench 4K | Overall Score88.3 | 40 | |
| High-Resolution Multimodal Reasoning | HR-Bench 8K | Overall Score86.4 | 40 | |
| High-Resolution Visual Perception | HR-Bench 4K | Accuracy73.25 | 40 | |
| High-resolution Visual Understanding | HR-Bench 4K | FSP96.5 | 37 | |
| Visual Reasoning | HR-Bench 4K FSP | ACC96.5 | 29 | |
| High-Resolution Visual Perception | HR-Bench 8K | Accuracy81.02 | 24 | |
| Visual Reasoning | HR-Bench 8K | Overall Score72.6 | 24 | |
| Visual Reasoning | HR-Bench 4K | Overall Score0.77 | 24 | |
| Fine-grained visual understanding | HR-Bench 4K | Score79 | 24 | |
| Visual Search | HR-Bench 8K | Accuracy76.3 | 23 | |
| Visual Search | HR-Bench 4K | Accuracy79.4 | 23 | |
| High-Resolution Visual Reasoning | HR-Bench | Score (4K)81 | 21 | |
| Visual Grounded Reasoning | HR-Bench-8K | Overall Score76.3 | 21 | |
| Visual Grounded Reasoning | HR-Bench-4K | Overall Score79.4 | 21 | |
| High-resolution Perception | HR-Bench 8K | Score82 | 19 | |
| High-Resolution Multimodal Reasoning | HR-Bench 8K FCP | Accuracy77 | 19 | |
| High-Resolution Multimodal Reasoning | HR-Bench 8K FSP | ACC94.8 | 19 | |
| High-Resolution Multimodal Reasoning | HR-Bench 4K FCP | ACC78.3 | 19 | |
| Visual Understanding | HR-Bench 8K | Avg@8 Exact Match86.6 | 17 | |
| Visual Understanding | HR-Bench 4K | Avg@8 Exact Match90.2 | 17 | |
| Fine-grained visual understanding | HR-Bench 8K | Score74.9 | 17 | |
| Visual Reasoning | HR-Bench (test) | Accuracy69.94 | 15 | |
| Visual Perception | HR-Bench | Accuracy75.12 | 11 |