| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Visual Perception | BLINK | Accuracy72.7 | 71 | |
| Visual Reasoning | BLINK | Accuracy81 | 50 | |
| Adversarial Attack | BLINK | Attack Success Rate (ASR)87.65 | 37 | |
| Multi-image Understanding | BLINK (val) | Score68 | 23 | |
| Multi-image reasoning | BLINK (val) | Accuracy52.6 | 21 | |
| Low-level Visual Reasoning | BLINK | Accuracy72.3 | 19 | |
| Visual Perception | Blink 41 (val) | Score87.4 | 19 | |
| Relative Depth Estimation | BLINK RelativeDepth (test) | Accuracy87.9 | 18 | |
| Spatial Reasoning | BLINK | Depth51.61 | 15 | |
| Multimodal Multi-choice | BLINK | Accuracy60 | 15 | |
| Spatial Reasoning | BLINK Multi-view (test) | Accuracy63.91 | 15 | |
| Interleaved Image Multimodal Understanding | BLINK | Score66.3 | 15 | |
| Spatial Reasoning | BLINK | Dep. Score84.68 | 14 | |
| Visual Reasoning | BLINK-J | Accuracy88 | 14 | |
| Multi-image Understanding | Blink (test) | Accuracy56.8 | 12 | |
| Classification | Blink | Accuracy99.7 | 11 | |
| Time Series Classification | Blink | Accuracy100 | 11 | |
| Multi-image understanding | BLINK multi-img | Accuracy55.6 | 11 | |
| Perception | BLINK (test) | Accuracy0.647 | 11 | |
| Multimodal Reasoning | BLINK | Accuracy55.92 | 11 | |
| Spatial Reasoning | BLINK-R | Accuracy87.1 | 10 | |
| Spatial Reasoning | BLINK-S | Accuracy90.21 | 10 | |
| Visual Reasoning | BLINK (test) | Rel Depth78.23 | 10 | |
| Visual Perception and Reasoning | BLINK | Accuracy63 | 9 | |
| Spatial Reasoning | Blink (ood) | Accuracy60.7 | 8 |