Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Factuality on SimpleQA
Loading...
35.3
Factuality Score
Kimi-K2
20.012
23.981
27.95
31.919
Feb 11, 2026
Feb 12, 2026
Feb 13, 2026
Feb 15, 2026
Feb 16, 2026
Feb 17, 2026
Feb 19, 2026
Factuality Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Factuality Score
Kimi-K2
Model Variant=Base, #...
2026.02
35.3
Step 3.5 Flash
Model Variant=Base, #...
2026.02
31.6
GLM-4.5
Model Variant=Base, #...
2026.02
30
DeepSeek V3.2
Model Variant=Exp Base...
2026.02
27
DeepSeek V3.1
Model Variant=Base, #...
2026.02
26.3
Trinity Large Preview
tuning=instruct-tuned
2026.02
23.92
MiMo-V2 Flash
Model Variant=Base, #...
2026.02
20.6
Feedback
Search any
task
Search any
task