Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Multi-modal Understanding benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Multi-modal Understanding
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
LLaVA-Bench Wild
GPT4V
LLaVA^W Score
91.2
86
4d ago
MMBench EN
InternVL2.5-78B
Accuracy
88.3
64
18d ago
MMBench EN
Gemini-2.5-Pro
Overall Score
86.3
55
1mo ago
LLaVA Multi-modal Evaluation Suite (GQA, MMB, MME, POPE, SQA, VQAv2, TextVQA, MMMU, SEED-I) v1.6 (test)
Vanilla 13B
Average Score
100
53
1mo ago
MMBench (dev)
Mini-Gemini-HD
Overall Score
80.6
40
1mo ago
SEED-Bench (overall)
CSR
Overall Score
62.9
40
1mo ago
MMVet
GPT-4o
Accuracy
76.2
40
18d ago
MMBench
Oryx-1.5
Mean Accuracy
86.3
32
1mo ago
MMBench V1.1
InternVL3.5-38B
Accuracy
87.03
22
1mo ago
SEED-IMG
Vanilla 7B
Accuracy
69.7
20
1mo ago
MM-Vet
LLaVA1.5-BPO
Rec
46.9
19
23d ago
MME
LLaVA-NeXT-7B
MME Score
1,842
17
25d ago
MuirBench
Vanilla
Score
59.6
16
1mo ago
MMMU Pro (Overall)
Vanilla
Score
38.3
16
1mo ago
TVL Benchmark
TVL-LLaMA (ViT-Base)
SSVTP Score
6.16
14
11d ago
SEED-Bench all (val)
LLaVA-Next
Accuracy
65.6
14
1mo ago
MMBench (test)
ScalSelect
MMBench Accuracy (En)
65.3
12
1mo ago
MMMU
iLLaVA
Accuracy (77.8% reduction ratio)
64.3
11
1mo ago
SEED-ALL
LLaVA-1.5 Vanilla 13B
Accuracy
61.6
10
1mo ago
MME v1.0 (test)
LLaVA-1.5-13B
MME^P Score
1,531.3
10
1mo ago
MME
Vanilla Attack
M(cl, nc)
90.85
9
1mo ago
General Multi-modal Evaluation Suite (VQAv2, GQA, VisWiz, ScienceQA-IMG, TextVQA, POPE, MMBench, MM-Vet) standard (test val)
MoE-LLaVA-Phi DeRS-SM
VQAv2 Accuracy
77.7
9
1mo ago
MMBench EN 1.1 (dev)
Pixtral-12B
Accuracy
84.31
8
1mo ago
Q-Bench (test)
mPLUG-Owl2
Overall Score
62.9
8
1mo ago
CV-Bench, BLINK, RealWorldQA, MathVista, MMStar, MMVet
LATTE
Average Score
53.8
8
1mo ago
Showing 25 of 32 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs