Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Search on HLE text
Loading...
45.8
Score
Gemini-3-Pro
7.32
17.31
27.3
37.29
Feb 15, 2026
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
Gemini-3-Pro
Params=-, Evaluation T...
2026.02
45.8
GPT-5
Params=-
2026.02
41.7
Seed1.8
Params=-
2026.02
40.9
REDSearcher-MM-RL
Params=30B
2026.02
25.3
REDSearcher-MM-SFT
Params=30B
2026.02
24.4
Qwen3-VL Thinking
Params=235B
2026.02
14.5
Qwen3-VL Thinking
Params=30B
2026.02
8.8
Feedback
Search any
task
Search any
task