Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Language Evaluation on NaturalQuestions (Accuracy)
Loading...
0.433
Accuracy
Yuan3.0-1T Base
0.39868
0.40759
0.4165
0.42541
Jan 20, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Yuan3.0-1T Base
#Shots=1-shot, Archite...
2026.01
0.433
LLaMA-3.1-405B Base
#Shots=5-shot, Archite...
2026.01
0.415
DeepSeek-V3-Base
#Shots=5-shot, Archite...
2026.01
0.4
Feedback
Search any
task
Search any
task