Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Image Understanding benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Image Understanding
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
MMStar
GPT-4o-2024-11-20
Score
65.1
54
2d ago
TextVQA
MiniCPM-LLaMA3-V 2.5
Accuracy
725
40
11d ago
MMBench CN
Vanilla
Accuracy
60.6
39
1mo ago
MME
Top-k Routing
Score
2,312
39
1mo ago
Image Understanding Suite (TextVQA, ChartQA, MMStar, MMBench, MMVet, MME, RealWorldQA, COCO)
InternVL-3.5-30B-A3B-HF
TextVQA Score
85.76
34
1mo ago
SEED-Bench image
Lumina-DiMOO
Accuracy
83.1
27
1mo ago
MMBench
Top-k Routing
Score
83.81
23
11d ago
Image benchmarks Aggregate
MRoPE-I
Overall Score
64.82
21
11d ago
COCO
Top-k Routing
Score
69.3
16
1mo ago
MMSI-Bench 68 (test)
Gemini-2.0-Flash
Average Score
69.4
12
1mo ago
3DSRBench real 45 (test)
PolyV
Average Score
63.4
12
1mo ago
MMStar 11 (test)
PolyV
Average Score
71.4
11
1mo ago
MME-P
BAGEL-7B
MME-P Score
1,687
11
1mo ago
Image Understanding benchmarks GQA, MME, POPE, VQA^T, MMB, SQA
Vanilla
GQA Score
62.2
10
1mo ago
MMBench v1.1 (test)
BAGEL
MMB^i Score
85
10
1mo ago
MMEB Image v2
MetaEmbed
Accuracy (CLS)
68.1
9
1mo ago
LLaVABenchWilder
LLaVA-OneVision-7B w/ VL-MDR
Score
77.9
8
10d ago
LLaVABench
LLaVA-OneVision-7B w/ Skywork-VL
Score
101.9
8
10d ago
MMIU
VideoChat-TPO
MMIU Score
40.2
7
1mo ago
SEED-Bench 2
VideoChat-TPO
SEED-2 Image Score
67.3
6
1mo ago
Jetson Orin Nano Performance Benchmark
Mobile-O-0.5B
Vision Encoding Time (ms)
88
4
1mo ago
MacBook M2 Pro
Mobile-O-0.5B
Vision Encoding Time (ms)
56
4
1mo ago
COCO GPT-based evaluation
Chat-UniVi
Conversation Score
84.1
4
1mo ago
MathVerse
Sparse-LaViDa
Accuracy
37.9
2
1mo ago
MathVista
LaViDa-O
Accuracy
56.9
2
1mo ago
Showing 25 of 28 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs