Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Question Answering on Probe 1
Loading...
99
Accuracy
Gemini 2.5 Pro
35.56
52.03
68.5
84.97
Jan 21, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Gemini 2.5 Pro
2026.01
99
ChatGPT 5 Thinking
2026.01
98
Anthropic Claude-3-Haiku
2026.01
96
Naive Human
medical knowledge=none
2026.01
38
Feedback
Search any
task
Search any
task