Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Text-based Visual Question Answering on TextVQA (TQA)
Loading...
66.6
Score
VILA
24.272
35.261
46.25
57.239
Feb 15, 2026
Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Score
VILA
Pretrained-LLM=LLaMA2-...
2026.02
66.6
LLaVA-1.6(HD)
Pretrained-LLM=Vicuna-...
2026.02
64.9
Emu3-Chat
Model Category=Unified...
2026.02
64.7
mPLUG-Owl2
Pretrained-LLM=LLaMA2-...
2026.02
58.2
InternVL-Chat
Pretrained-LLM=Vicuna-...
2026.02
57
EVE-7B(HD)
Pretrained-LLM=Vicuna-...
2026.02
56.8
TokenFlow-L
Pretrained-LLM=Vicuna-...
2026.02
54.1
UniWeTok-Chat
Pretrained-LLM=Qwen3-8...
2026.02
53.7
UniTok
Pretrained-LLM=LLaMA-2...
2026.02
51.6
InstructBLIP
Pretrained-LLM=Vicuna-...
2026.02
50.1
VILA-U
Pretrained-LLM=LLaMA2-...
2026.02
48.3
LWM-7B
Model Category=Unified...
2026.02
47.7
LLaVA-1.5
Pretrained-LLM=Vicuna-...
2026.02
46.1
IDEFICS-9B
Pretrained-LLM=LLaMA-7...
2026.02
25.9
Feedback
Search any
task
Search any
task