| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Overall Multimodal Evaluation | Qwen2-VL 7B Evaluation Suite | Relative Accuracy100 | 5 | |
| Keyword Matching Attack | Qwen2-VL-7B | KMR Alpha89.9 | 4 | |
| Energy consumption ranking | Qwen2-VL image workload | Pairwise Accuracy89.3 | 2 | |
| Energy consumption ranking | Qwen2-VL text workload | Pairwise Accuracy95.8 | 2 |