Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VQA-RAD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Medical Visual Question AnsweringVQA-RAD
Accuracy80.4
228
Medical Visual Question AnsweringVQA-RAD (Closed)
ECE1.3
96
Visual Question AnsweringVQA-RAD (Open)
AUROC0.819
96
Visual Question AnsweringVQA-RAD Closed
AUROC70.2
96
Visual Question AnsweringVQA-RAD
Closed Accuracy86.8
64
Hallucination DetectionVQA-RAD (All)
AUC78.23
57
Hallucination DetectionVQA-RAD Open-Ended
AUC83.13
57
Medical Visual Question AnsweringVQA-RAD (test)
Closed Accuracy87.9
50
Visual Question AnsweringVQA-RAD (test)
Overall Accuracy90.4
48
Medical Visual Question AnsweringVQA-RAD closed-end
Accuracy84.86
45
Multimodal Medical ReasoningVQA-RAD
Accuracy (%)80.45
36
Medical Visual Question AnsweringVQA-RAD Open
Accuracy61.5
26
Visual Question AnsweringVQA-RAD open-ended
Exact Match (EM)29
25
Visual Question AnsweringVQA-RAD Open
Token Recall73.7
16
Visual Question Answering (Closed-ended)VQA-RAD closed-ended
Accuracy82.5
12
Multi-modal Question AnsweringVQA-RAD
Accuracy87.1
12
Visual Question AnsweringVQA-RAD Closed
Exact Match88
11
Medical Visual Question AnsweringVQA-RAD cross-domain
Accuracy0.789
10
Medical Visual Question AnsweringVQA-RAD (in-domain)
Accuracy83.3
10
Question SelectionVQA-RAD (test)
Risk60.6
7
Visual Question AnsweringVQA-RAD Closed
Accuracy88.2
7
Medical Visual Question AnsweringVQA-RAD
BLEU-10.695
7
Medical Visual Question AnsweringVQA-Rad 2018
Accuracy87.05
7
Medical Visual Question AnsweringVQA-RAD
L-VASE94.4
6
ReasoningVQA-RAD
Correctness47.34
6
Showing 25 of 30 rows