Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Interactive Question Answering on QuestBench Math
Loading...
53.9
Accuracy
OursI
16.564
26.257
35.95
45.643
Feb 4, 2026
Accuracy
Coverage
Uniqueness
Updated 5d ago
Evaluation Results
Method
Method
Links
Accuracy
Coverage
Uniqueness
OursI
Evaluation Protocol=Mu...
2026.02
53.9
83.5
9.7
OursO
Evaluation Protocol=Mu...
2026.02
38.8
68.2
35.4
Gemini
Evaluation Protocol=Mu...
2026.02
35.4
33.5
4.4
GPT
Evaluation Protocol=Mu...
2026.02
32
31.6
2.4
Qwen
Evaluation Protocol=Mu...
2026.02
32
37.9
0.5
FATA
Evaluation Protocol=Mu...
2026.02
32
41.9
1.5
AskToAct
Evaluation Protocol=Mu...
2026.02
18
27.2
5.3
Feedback
Search any
task
Search any
task