Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Factual Question Answering on SimpleQA (test)
Loading...
79.07
Accuracy
HEART
32.3948
44.5124
56.63
68.7476
Sep 26, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
HEART
Model=Gemini 3 Pro, Pr...
2025.09
79.07
Wait
Model=Gemini 3 Pro, Pr...
2025.09
73.61
Self Reflection
Model=Gemini 3 Pro, Pr...
2025.09
72.91
CoT
Model=Gemini 3 Pro, Pr...
2025.09
72.73
Vanilla
Model=Gemini 3 Pro, Pr...
2025.09
70.7
HEART
Model=Gemini 3 Flash,...
2025.09
59.85
CoT
Model=Gemini 3 Flash,...
2025.09
58.38
Wait
Model=Gemini 3 Flash,...
2025.09
55.1
Self Reflection
Model=Gemini 3 Flash,...
2025.09
53.1
Vanilla
Model=Gemini 3 Flash,...
2025.09
34.19
Feedback
Search any
task
Search any
task