Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Document and chart understanding on CharXiv RQ
Loading...
69.9
Pass@1
Gemini 1.5 Pro
48.892
54.346
59.8
65.254
May 11, 2025
Pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
Gemini 1.5 Pro
thinking=true, decodin...
2025.05
69.9
Claude 3.7 Sonnet
thinking=true, decodin...
2025.05
68.9
Seed 1.5-VL
thinking=true, decodin...
2025.05
60.2
Seed 1.5-VL
thinking=false, decodi...
2025.05
59.8
OpenAI o1
thinking=true, decodin...
2025.05
55.1
GPT-4o
thinking=false, decodi...
2025.05
52
Qwen 2.5-VL 72B
thinking=false, decodi...
2025.05
49.7
Feedback
Search any
task
Search any
task