Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SlideVQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Document Visual Question AnsweringSlideVQA
Accuracy0.849
53
Visual Question AnsweringSlideVQA
Overall Accuracy78.74
46
Slide Question AnsweringSlideVQA
Overall Score79.5
29
Document Question AnsweringSlideVQA (test)
EM63.2
19
Visual Document RetrievalSlideVQA
Recall@1097.87
13
Visual Document RetrievalSlideVQA
NDCG@582.5
13
Document UnderstandingSlideVQA
F1 Score77.1
8
Evidence SelectionSlideVQA
ES EM97.7
6
Multimodal Document RetrievalSlideVQA
MRR93.91
6
RetrievalSlideVQA
R@392.81
6
Local Question AnsweringSlideVQA 2k
Accuracy64.85
5
Visual Question AnsweringSlideVQA (test)
Overall Accuracy90.5
4
Document Question AnsweringSlideVQA
TFLOPS (Encoder)9.9
2
Showing 13 of 13 rows