| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| High-resolution perception | HR Bench 4K | Overall Score87.9 | 103 | |
| High-resolution Visual Understanding | HR-Bench 8K | FSP95 | 83 | |
| High-Resolution Visual Perception | HR-Bench 4K | Accuracy89.63 | 79 | |
| High-Resolution Visual Perception | HR-Bench 8K | Accuracy86.88 | 63 | |
| High-resolution Visual Understanding | HR-Bench 4K | FSP96.5 | 49 | |
| Visual Reasoning | HR-Bench 8K | Overall Score72.6 | 42 | |
| Visual Reasoning | HR-Bench 4K | Overall Score0.77 | 42 | |
| High-Resolution Multimodal Reasoning | HR-Bench 4K | Overall Score88.3 | 40 | |
| High-Resolution Multimodal Reasoning | HR-Bench 8K | Overall Score86.4 | 40 | |
| High-resolution Perception | HR-Bench 8K | Score82 | 32 | |
| High-Resolution Visual Reasoning | HR-Bench | Score (4K)81 | 30 | |
| Visual Reasoning | HR-Bench 4K FSP | ACC96.5 | 29 | |
| Visual Search | HR-Bench 8K | Accuracy77.8 | 29 | |
| Visual Search | HR-Bench 4K | Accuracy79.4 | 29 | |
| High-Resolution Visual Reasoning | HR-Bench 8K | Accuracy93.5 | 28 | |
| High-Resolution Image Perception | HR-Bench 8K | Overall Score77.6 | 26 | |
| Fine-grained visual search | HR-Bench 8K | Overall Score78 | 24 | |
| Fine-grained visual understanding | HR-Bench 4K | Score79 | 24 | |
| Visual Grounded Reasoning | HR-Bench-8K | Overall Score76.3 | 21 | |
| Visual Grounded Reasoning | HR-Bench-4K | Overall Score79.4 | 21 | |
| High-Resolution Multimodal Reasoning | HR-Bench 8K FCP | Accuracy77 | 19 | |
| High-Resolution Multimodal Reasoning | HR-Bench 8K FSP | ACC94.8 | 19 | |
| High-Resolution Multimodal Reasoning | HR-Bench 4K FCP | ACC78.3 | 19 | |
| Visual Search and Perception-intensive Reasoning | HR-Bench 8K | Score70 | 18 | |
| Visual Search and Perception-intensive Reasoning | HR-Bench 4K | Overall Score76.63 | 18 |