Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Multimodal Question Answering benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Multimodal Question Answering
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
ScienceQA (test)
LLaVa + GPT-4 (judge)
Accuracy
92.53
65
4d ago
ScienceQA
CASHEW
Accuracy
97.8
41
15d ago
MMQA
Original Model
Accuracy
70.5
36
1mo ago
SUPERGLASSES 1.0 (Leaderboard)
SUPERLENS‡ (Ours)
Accuracy (Easy)
49.68
28
8d ago
MMBench en (test)
Vanilla
Accuracy
89
26
11d ago
MM-Vet
Qwen3-VL-4B
Total Score
68.3
24
1mo ago
MMBench CN
MergeMix
Accuracy
81.18
23
25d ago
Aggregate (Open-WikiTable, 2WikiMQA, InfoSeek, Dyn-VQA, TabFact, WebQA)
MoRE-7B
Average Score
55.93
22
11d ago
WebQA
MoRE-7B
F1-Recall
90.92
22
11d ago
TabFact
MoRE-3B
F1-Recall
52.6
22
11d ago
Dyn-VQA
R1-Distill-Qwen-32B
F1-Recall
39.98
22
11d ago
2WikiMQA
MoRE-7B
F1-Recall
55.47
22
11d ago
Open-WikiTable
MoRE-7B
F1 Recall
53.9
22
11d ago
ScienceQA v1.3 (test)
Full Precision (Baseline)
NAT Score
0.9019
21
1mo ago
SEED-Bench
QMoSLoRA
Accuracy (All)
71.1
21
1mo ago
MME-RealWorld-Lite 1.0 (test)
HART-7B
Perception (AD) Acc
57.7
19
1mo ago
9 Multimodal Benchmarks (VQAv2, GQA, VizWiz, SQA-IMG, TextVQA, POPE, MME, MMB, MMB-CN) (test val)
LLaVA-1.5-13B
VQAv2 Accuracy
80
15
1mo ago
Recap-COCO
TCAP (Ours)
CP
65.94
15
1mo ago
MM-Lifelong @day (test)
Human
Accuracy
99.2
14
1mo ago
MM-Lifelong week (test)
Human
Accuracy
95.6
14
1mo ago
MM-Lifelong (val@month)
Human
Accuracy
80.4
14
1mo ago
DesignQA
GPT-5-MCERF-Hybrid
Retrieval F1 (BoW)
95
13
4d ago
MMBench English
CoVFT
MMBen
70.45
13
25d ago
CaReSound
AudioTSLM (1.4B)
Yes/No Accuracy
93.5
13
1mo ago
MULTIMODALQADoc
FIF
EM
65.15
12
1mo ago
Showing 25 of 47 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs