Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SeedBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multimodal UnderstandingSEEDBench2 Plus
Accuracy76.86
74
Comprehensive EvaluationSeedBench (all)
Score76.8
19
Complex Scene ReasoningSEEDBench
Accuracy (All)77.03
17
Visual PerceptionSeedBench-2-Plus
Accuracy72
15
Multidisciplinary KnowledgeSeedBench
Score76.4
15
Multimodal UnderstandingSeedBench image
Score76.7
12
Visual Question AnsweringSeedBench Avg
Accuracy77.96
11
Multi-discipline Video UnderstandingSeedBench video
Accuracy62.1
11
Multimodal UnderstandingSeedBench-2
Score81
10
Short-answer Visual Question AnsweringSEEDBench2+
Accuracy68.7
9
Visual Question AnsweringSEEDBench
BD-Rate-23.78
8
Multimodal EvaluationSeedBench (all)
Score61.82
8
VLM evaluationSEEDBench
BD-S Score12.88
6
General image understandingSEEDBench
Score72.67
4
Multimodal UnderstandingSeedBench
SeedBench Accuracy38.7
3
Showing 15 of 15 rows