| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Visual Perception | MMVP | Accuracy76.67 | 118 | |
| Visual Question Answering | MMVP | Accuracy80.33 | 82 | |
| Multimodal Visual Perception | MMVP | Accuracy85.33 | 72 | |
| Visual Reasoning | MMVP | Accuracy86.3 | 58 | |
| Vision Understanding | MMVP | Accuracy86.33 | 36 | |
| Visual Pattern Recognition | MMVP | Accuracy80.9 | 30 | |
| Detail Perception | MMVP VLM | Orientation and Direction Accuracy26.7 | 27 | |
| Multimodal Reasoning | MMVP | Accuracy76 | 26 | |
| Multimodal Visual Pattern Understanding | MMVP | Accuracy80.33 | 25 | |
| Fine-Grained Perception | MMVP | Accuracy74.67 | 24 | |
| Multimodal Visual Pattern Recognition | MMVP | MMVP Score75.3 | 23 | |
| Vision-centric Reasoning | MMVP | Accuracy86.3 | 21 | |
| Multimodal Reasoning | MMVP | Accuracy57.6 | 16 | |
| Spatial Understanding | MMVP | Accuracy77 | 15 | |
| Visual Perception | MMVP (test) | MMVP Score40.3 | 13 | |
| Hallucination | MMVP | Accuracy72.1 | 13 | |
| Image Understanding | MMVP | Score72.1 | 12 | |
| Visual Question Answering | MMVP | Sentence Faithfulness (Insertion)0.8052 | 12 | |
| Vision-Centric Evaluation | MMVP | Score65.2 | 12 | |
| Multimodal Visual Pattern Understanding | MMVP-VLM (test) | Orientation & Direction Acc0.267 | 12 | |
| Fine-grained Perception | MMVP (test) | MMVP Score75.33 | 11 | |
| Perception | MMVP (test) | Accuracy68.7 | 11 | |
| Fine-grained Visual Pattern Recognition | MMVP-VLM | Orientation Score60 | 11 | |
| Multimodal Multi-choice | MMVP | Accuracy75.3 | 10 | |
| Visual Question Answering | MMVP-VLM | Orientation & Direction Score26.7 | 10 |