Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Scene Text-Centric Visual Question Answering on STVQA

0.622Accuracy

InternVL

0.000080.161540.3230.48446Jul 23, 2024
Updated 2d ago

Evaluation Results

MethodLinks
2024.07
0.622
2024.07
0.596
2024.07
0.547
2024.07
0.513
2024.07
0.498
2024.07
0.497
2024.07
0.472
2024.07
0.455
2024.07
0.392
2024.07
0.381
2024.07
0.352
2024.07
0.264
2024.07
0.201
2024.07
0.024