Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Document Visual Question Answering on ArxiVQA
Loading...
60
Accuracy
GPT-4o+Ours (Spot-IT)
37.12
43.06
49
54.94
Aug 7, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-4o+Ours (Spot-IT)
zero-shot=true
2025.08
60
Gem-1.5-Flash+Ours (Spot-IT)
zero-shot=true
2025.08
54
Gem-1.5-Flash
zero-shot=true
2025.08
53
GPT-4o
zero-shot=true
2025.08
52
GPT-4o-mini+Ours (Spot-IT)
zero-shot=true
2025.08
52
GPT-4o+CoT
zero-shot=true
2025.08
51
GPT-4o-mini
zero-shot=true
2025.08
47
Qwen2-7B
zero-shot=true
2025.08
44
Llama-3.2+Ours (Spot-IT)
zero-shot=true
2025.08
44
Qwen2-7B+Ours (Spot-IT)
zero-shot=true
2025.08
44
Llama-3.2+CoT
zero-shot=true
2025.08
42
GPT-4o+OCR
zero-shot=true
2025.08
41
Llama-3.2-VL-11B
zero-shot=true
2025.08
41
Llama-3.2+OCR
zero-shot=true
2025.08
38
Feedback
Search any
task
Search any
task