Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OCR-VQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visual Question AnsweringOCR-VQA (test)
Accuracy77.8
77
OCR-based Visual Question AnsweringOCR-VQA
Accuracy65.6
35
Image question answeringOCR-VQA
Accuracy75.8
27
Visual Question AnsweringOCR-VQA (val)
Accuracy71.1
17
QA over IllustrationsOCR-VQA (test)
F1 Score74
5
Visual Question AnsweringOCR-VQA
Exact Match (EM)77.8
4
Showing 6 of 6 rows