Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ST-VQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visual Question AnsweringST-VQA
Accuracy80.5
30
Scene Text Visual Question AnsweringST-VQA (val)
ANLS0.845
30
Scene Text Visual Question AnsweringST-VQA (test)
ANLS0.799
21
Visual Question AnsweringST-VQA (test)
ANLS75.8
15
Scene-Text Visual Question AnsweringST-VQA 1.0 (val)
ANLS72.9
15
Scene-Text Visual Question AnsweringST-VQA 1.0 (test)
ANLS71.8
14
Copyright trackingST-VQA
ASR56
13
Scene Text Visual Question AnsweringST-VQA
Accuracy68.96
10
Scene Text Visual Question AnsweringST-VQA 8 (test)
ANLS69.6
10
Copyright TrackingST-VQA full (train)
ASR77
8
Scene Text Visual Question AnsweringST-VQA 8 (val)
Accuracy0.6164
8
Image question answeringST-VQA public server (test)
Accuracy75.8
3
Image question answeringST-VQA public server
Accuracy-
0
Showing 13 of 13 rows