Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SEED-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal UnderstandingSEED-Bench
Accuracy81.7
343
Multimodal UnderstandingSEED-Bench Image
Accuracy78
121
Multimodal EvaluationSEED-Bench
Accuracy77.01
95
Visual Question AnsweringSEED-Bench Image
Accuracy76.9
64
Multi-modal UnderstandingSEED-Bench (overall)
Overall Score62.9
40
Vision-Language EvaluationSEED-Bench
Accuracy74.74
34
Video UnderstandingSEED-Bench Video Understanding
Accuracy74.12
33
Multimodal ReasoningSEED-Bench Image
Score74.2
32
Multimodal UnderstandingSEED Bench Img
SEEDB Score77
32
Multimodal EvaluationSEED-Bench 2 Plus
Accuracy71.67
29
Multimodal EvaluationSEED-Bench
SEED-Bench Score66.8
28
Image UnderstandingSEED-Bench image
Accuracy83.1
27
Video ReasoningSeed-Bench R1
Average Answer Score50.5
26
Multi-modal BenchmarkingSEED-Bench
Score60.5
25
Visual UnderstandingSEED-Bench
SEED Score71.8
23
Multimodal Question AnsweringSEED-Bench
Accuracy (All)71.1
21
Benchmark Compression (Coreset selection)SEED-Bench-2-Plus (full)
rho0.874
20
Multimodal UnderstandingSEED-Bench SEED-I
Accuracy87.7
20
Multimodal UnderstandingSEED-Bench Image (test)
Accuracy75.9
20
Visual PerceptionSEED-Bench Image
Accuracy73.7
18
Video ReasoningSEED-Bench L3 OOD R1
Accuracy49.3
16
Video ReasoningSEED-Bench L2 OOD R1
Accuracy51.6
16
Video ReasoningSEED-Bench-R1 L1 In-Dist.
Accuracy50.5
16
Multimodal UnderstandingSEED-Bench (val)
Accuracy58.8
16
Multimodal UnderstandingSEED-Bench 1
Image Accuracy73.5
15
Showing 25 of 58 rows