Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Grounding & counting on BLINK
Loading...
72.1
Pass@1
Seed 1.5-VL
62.116
64.708
67.3
69.892
May 11, 2025
Pass@1
Updated 1mo ago
Evaluation Results
Method
Method
Links
Pass@1
Seed 1.5-VL
thinking=true, decodin...
2025.05
72.1
Gemini 1.5 Pro
thinking=true, decodin...
2025.05
70.6
Seed 1.5-VL
thinking=false, decodi...
2025.05
70.2
OpenAI o1
thinking=true, decodin...
2025.05
66.1
GPT-4o
thinking=false, decodi...
2025.05
65.9
Qwen 2.5-VL 72B
thinking=false, decodi...
2025.05
64.4
Claude 3.7 Sonnet
thinking=true, decodin...
2025.05
62.5
Feedback
Search any
task
Search any
task