Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Visual Question Answering on RealWorldQA (Pass@1)
Loading...
78.4
Pass@1
Seed 1.5-VL
67.376
70.238
73.1
75.962
May 11, 2025
Pass@1
Updated 3d ago
Evaluation Results
Method
Method
Links
Pass@1
Seed 1.5-VL
thinking=true, decodin...
2025.05
78.4
Gemini 1.5 Pro
thinking=true, decodin...
2025.05
78
OpenAI o1
thinking=true, decodin...
2025.05
77.1
Seed 1.5-VL
thinking=false, decodi...
2025.05
77
GPT-4o
thinking=false, decodi...
2025.05
76.2
Qwen 2.5-VL 72B
thinking=false, decodi...
2025.05
75.7
Claude 3.7 Sonnet
thinking=true, decodin...
2025.05
67.8
Feedback
Search any
task
Search any
task