Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Document and chart understanding on AI2D (Pass@1)
Loading...
88.7
Pass@1
Qwen 2.5-VL 72B
79.132
81.616
84.1
86.584
May 11, 2025
Pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
Qwen 2.5-VL 72B
thinking=false, decodi...
2025.05
88.7
Seed 1.5-VL
thinking=false, decodi...
2025.05
88.5
Gemini 1.5 Pro
thinking=true, decodin...
2025.05
88.4
Seed 1.5-VL
thinking=true, decodin...
2025.05
87.3
GPT-4o
thinking=false, decodi...
2025.05
84.9
Claude 3.7 Sonnet
thinking=true, decodin...
2025.05
82.1
OpenAI o1
thinking=true, decodin...
2025.05
79.5
Feedback
Search any
task
Search any
task