Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SEED-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal UnderstandingSEED-Bench
Accuracy81.7
203
Multimodal UnderstandingSEED-Bench Image
Accuracy78
82
Multimodal EvaluationSEED-Bench
Accuracy77.01
80
Visual Question AnsweringSEED-Bench Image
Accuracy76.9
64
Multi-modal UnderstandingSEED-Bench (overall)
Overall Score62.9
40
Video UnderstandingSEED-Bench Video Understanding
Accuracy74.12
33
Multimodal ReasoningSEED-Bench Image
Score74.2
32
Multimodal UnderstandingSEED Bench Img
SEEDB Score77
32
Multimodal Question AnsweringSEED-Bench
Accuracy (All)71.1
21
Image UnderstandingSEED-Bench image
Accuracy76.55
20
Benchmark Compression (Coreset selection)SEED-Bench-2-Plus (full)
rho0.874
20
Multimodal UnderstandingSEED-Bench SEED-I
Accuracy87.7
20
Multimodal UnderstandingSEED-Bench Image (test)
Accuracy75.9
20
Visual PerceptionSEED-Bench Image
Accuracy73.7
18
Multimodal UnderstandingSEED-Bench (val)
Accuracy58.8
16
Multimodal EvaluationSEED-Bench
SEED-Bench Score66.8
15
Multimodal UnderstandingSEED-Bench 1
Image Accuracy73.5
15
Spatial UnderstandingSEED-Bench Spatial
Accuracy66.28
15
Multimodal UnderstandingSEED-Bench Image Part
Accuracy75.9
15
Multimodal ReasoningSEED-BENCH
Accuracy69.9
14
Multi-modal UnderstandingSEED-Bench all (val)
Accuracy65.6
14
General Visual Question AnsweringSEED-Bench IMG 2023a
Accuracy77
13
Visual ReasoningSEED-Bench 2-Plus
Accuracy72
11
Multimodal EvaluationSEED-Bench Image
Accuracy77.39
10
Comprehensive Multimodal EvaluationSEED-Bench Image
Accuracy77.3
10
Showing 25 of 44 rows