Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Text-based Visual Question Answering on TextVQA (VQA^T)

70.4Accuracy

CogVLM-17B

24.1236.13548.1560.165May 28, 2024Aug 31, 2024Dec 4, 2024Mar 9, 2025Jun 12, 2025Sep 15, 2025Dec 20, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2024.07
70.4
2024.05
68.8
2024.08
68
2024.05
67.2
2024.05
67.1
2024.05
67.1
2024.05
67
2024.07
66.6
2024.07
65.9
2024.05
65.5
2024.07
65.2
2024.07
65.1
2024.05
65
2024.07
64.4
2024.05
64.2
2024.07
63.8
2024.08
63.8
2024.05
63.8
2025.12
63.8
2024.05
63
2024.05
62.6
2024.08
61.5
2024.07
61.3
2024.08
61.3
2024.05
61.1
2024.08
58.7
2024.08
58.5
2024.07
58.2
2024.07
58.2
2024.08
58.2
2025.12
58.2
2024.08
57.1
2024.08
57.1
2024.08
57
2024.06
57
2024.08
56.9
2024.06
56.7
2024.05
51.8
2024.08
51.4
2024.07
50.7
2024.08
50.7
2025.12
50.7
2024.07
50.1
2024.08
50.1
2025.12
50.1
2024.05
49.6
2024.05
49
2025.12
48.58
2024.05
48.1
2024.05
47.6
2024.05
47.5
2025.12
47.14
2025.12
46.91
2025.12
46.58
2024.08
42.5
2025.12
42.5
2025.12
40.98
2025.12
40.75
2025.12
40.36
2025.12
40.28
2024.08
30.9
2025.12
30.9
2024.07
25.9
2024.08
25.9
2025.12
25.9