Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Question Answering on BBQ (ambiguous)
Loading...
95
Accuracy
gpt-5-thinking
87.72
89.61
91.5
93.39
Dec 19, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
gpt-5-thinking
web search=with web se...
2025.12
95
OpenAI o3
web search=with web se...
2025.12
94
gpt-5-thinking
web search=without web...
2025.12
93
gpt-5-main
web search=without browse
2025.12
93
GPT-4o
web search=without browse
2025.12
88
Feedback
Search any
task
Search any
task