| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multimodal Instruction Following | MIA-Bench | Score8.86 | 12 | |
| Membership Inference | MIA-Bench 5% Forget Set | Average Performance69.6 | 12 | |
| Multimodal Reasoning | MIA-Bench | Length (tokens)4,329.3 | 9 | |
| Alignment | MIA-Bench | Accuracy93.3 | 7 | |
| Multi-modal human-preference alignment | MIA-Bench | Score89.6 | 6 | |
| Visual Question Answering | MIA-Bench | Accuracy68.8 | 4 |