Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
STEM & Reasoning on VisualPuzzle
Loading...
71.48
Accuracy
Gemini 3-Pro
53.5504
58.2052
62.86
67.5148
Feb 4, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Gemini 3-Pro
2026.02
71.48
ERNIE 5.0
2026.02
64.82
Gemini 2.5-Pro
2026.02
61.51
GPT-5
tier=High
2026.02
57.75
Qwen3-VL
mode=Thinking
2026.02
57.01
ERNIE 5.0-Base
Model type=pre-trained
2026.02
54.24
Feedback
Search any
task
Search any
task