| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multimodal Understanding | SEED-Bench | Accuracy81.7 | 516 | |
| Multimodal Understanding | SEED-Bench Image | Accuracy78 | 143 | |
| Multimodal Evaluation | SEED-Bench | Accuracy77.3 | 112 | |
| Visual Question Answering | SEED-Bench Image | Accuracy76.9 | 78 | |
| Multimodal Reasoning | SEED-Bench Image | Score78.6 | 60 | |
| Vision-Language Evaluation | SEED-Bench | Accuracy74.74 | 50 | |
| Multi-modal Understanding | SEED-Bench (overall) | Overall Score62.9 | 40 | |
| Multimodal Reasoning | SEED-BENCH | Accuracy69.9 | 36 | |
| Video Understanding | SEED-Bench Video Understanding | Accuracy74.12 | 33 | |
| Multimodal Understanding | SEED Bench Img | SEEDB Score77 | 32 | |
| Multimodal Evaluation | SEED-Bench 2 Plus | Accuracy71.67 | 29 | |
| Multimodal Evaluation | SEED-Bench | SEED-Bench Score66.8 | 28 | |
| Image Understanding | SEED-Bench image | Accuracy83.1 | 27 | |
| Video Reasoning | Seed-Bench R1 | Average Answer Score50.5 | 26 | |
| Multi-modal Benchmarking | SEED-Bench | Score60.5 | 25 | |
| Visual Understanding | SEED-Bench | SEED Score71.8 | 23 | |
| Visual Question Answering | SEED-Bench | Accuracy94.5 | 22 | |
| Visual Question Answering | SEED-Bench 2-Plus | Accuracy70.32 | 21 | |
| OCR-related Understanding Tasks | SEED-Bench-2-Plus | Accuracy76.5 | 21 | |
| Multimodal Question Answering | SEED-Bench | Accuracy (All)71.1 | 21 | |
| Benchmark Compression (Coreset selection) | SEED-Bench-2-Plus (full) | rho0.874 | 20 | |
| Multimodal Understanding | SEED-Bench SEED-I | Accuracy87.7 | 20 | |
| Multimodal Understanding | SEED-Bench Image (test) | Accuracy75.9 | 20 | |
| Multimodal Question Answering | SEED-Bench IMG | Accuracy71.56 | 18 | |
| Multimodal Large Language Model Evaluation | SEED-Bench | Accuracy71.56 | 18 |