Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Scene Text-Centric Visual Question Answering on OCRVQA
Loading...
64.4
Accuracy
Monkey
-0.184
16.583
33.35
50.117
Jul 23, 2024
Accuracy
Updated 2d ago
Evaluation Results
Method
Method
Links
Accuracy
Monkey
Model Generation Capab...
2024.07
64.4
mPLUG-Owl2
Model Generation Capab...
2024.07
58.7
LLaVA1.5-7B
Model Generation Capab...
2024.07
58.1
TextHarmony-Chat
Model Generation Capab...
2024.07
57.6
DocPedia
Model Generation Capab...
2024.07
57.2
TextHarmony
Model Generation Capab...
2024.07
55.3
TextHarmony*
Model Generation Capab...
2024.07
51.9
InternLM-XComposer2
Model Generation Capab...
2024.07
49.6
UniDoc
Model Generation Capab...
2024.07
36.8
InternVL
Model Generation Capab...
2024.07
30.5
LLaVAR
Model Generation Capab...
2024.07
24
SEED-LLaMA-14B
Model Generation Capab...
2024.07
16.3
MM-Interleaved
Model Generation Capab...
2024.07
11.7
MiniGPT5
Model Generation Capab...
2024.07
2.3
Feedback
Search any
task
Search any
task