Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TextVQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visual Question AnsweringTextVQA
Accuracy85.4
1,285
Text-based Visual Question AnsweringTextVQA
Accuracy88.5
807
Visual Question AnsweringTextVQA (val)
VQA Score7,040
343
Text-based Visual Question AnsweringTextVQA (val)
Accuracy86.5
262
Visual Question AnsweringTextVQA (test)
Accuracy81.1
124
Text-based Visual Question AnsweringTextVQA
Score67.32
112
Text-based Visual Question AnsweringTextVQA (VQA^T)
Accuracy78
96
Visual Question AnsweringTextVQA
Accuracy88.7
94
Visual Question AnsweringTextVQA v1.0 (val)
Accuracy85.5
84
Visual Question AnsweringTextVQA
Accuracy97.15
79
Visual Question AnsweringTextVQA
TextVQA Accuracy80.12
67
OCR-related Understanding TasksTextVQA (val)
Accuracy86.62
57
OCR Visual Question AnsweringTextVQA
Accuracy83.69
45
Image UnderstandingTextVQA
Accuracy725
40
Visual Question AnsweringTextVQA v1.0 (test)
Accuracy86.79
40
Visual Question AnsweringTextVQA
Clean Accuracy70.3
37
Visual Question AnsweringTextVQA
VQA Accuracy39
33
Refusal Rate EvaluationTextVQA
Refusal Rate70
30
Text-based Visual Question AnsweringTextVQA VQAT
Accuracy69.74
30
Visual Question AnsweringTextVQA (test val)
Accuracy58.2
30
Visual Question AnsweringTextVQA
Accuracy81.57
26
Visual Question AnsweringTextVQA
Exact Match (EM)82.74
23
Text-based Visual Question AnsweringTextVQA (test)
Accuracy83.8
23
Visual Question AnsweringTextVQA 130 (val)
Score86.5
23
Text-based Visual Question AnsweringTextVQA 52
Accuracy63.8
23
Showing 25 of 64 rows