Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA Visual Understanding benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
Visual Understanding
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
MM-Vet
GPT-4o
MM-Vet Score
76.9
102
3d ago
MME
Qwen2-VL
MME Score
2,321
37
3d ago
MME perception and cognition v1.0
BAGEL
MME Perception Score
1,687
24
3d ago
V* Bench
SenseNova-MARS-32B
Avg@8 EM
0.942
18
3d ago
HR-Bench 8K
SenseNova-MARS-32B
Avg@8 Exact Match
86.6
17
3d ago
HR-Bench 4K
SenseNova-MARS-32B
Avg@8 Exact Match
90.2
17
3d ago
V* Bench, HR-Bench, and MME RealWorld
SenseNova-MARS-32B
Average Score
85.9
13
3d ago
MME RealWorld
SenseNova-MARS-32B
Pass@1 Exact Match
72.7
13
3d ago
JARVIS-VLA Benchmark 1.0 (test)
GPT-4o
Accuracy
76.7
10
3d ago
MMBench-EN (full)
Bagel
Score
85
9
3d ago
SEED-Bench
MMAR-7B
SEED Score
68.63
9
3d ago
R-Bench (test)
Robust-R1 (SFT and RL)
MCQ (low)
65.29
8
3d ago
MMT
LLaVA
Score
1,075.5
8
3d ago
RealWorldQA
Robust-R1 (SFT)
Accuracy (Clean)
68.23
7
3d ago
MMStar
Robust-R1 (SFT and RL)
Accuracy (Clean)
56.86
7
3d ago
MMMB
Robust-R1 (SFT and RL)
Accuracy (Clean)
81.41
7
3d ago
MathVista mini (full)
InternVL3.5
Score
77.1
7
3d ago
OCRBench (full)
VINO
Score
881
6
3d ago
MVBench (full)
InternVL3.5
Score
71.2
6
3d ago
18 Visual Understanding Assessments VLMEvalKit
MMAR-7B
AVE@18Und.
48.25
6
3d ago
Visual Understanding (VQAv2, NLVR2, MME) (held-out)
SKILLRATER
Accuracy
71.54
4
3d ago
CV-Bench
ERNIE 5.0-Base
Accuracy
86.96
1
3d ago
Showing 22 of 22 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Terms of Service
FAQs