Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Interactive Question Answering on AskOverconfidence
Loading...
84
Accuracy
Gemini
42.712
53.431
64.15
74.869
Feb 4, 2026
Accuracy
Coverage
Uniqueness
Updated 5d ago
Evaluation Results
Method
Method
Links
Accuracy
Coverage
Uniqueness
Gemini
Evaluation Protocol=Mu...
2026.02
84
74.9
2.5
GPT
Evaluation Protocol=Mu...
2026.02
73
60.2
1.5
OursI
Evaluation Protocol=Mu...
2026.02
62.8
64.1
21
OursO
Evaluation Protocol=Mu...
2026.02
54.8
89.4
46.3
Qwen
Evaluation Protocol=Mu...
2026.02
44.3
18.8
0.8
Feedback
Search any
task
Search any
task