Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Language Modeling on Pile (test)
Loading...
59.4
Accuracy
Yuan3.0-1T Base
53.992
55.396
56.8
58.204
Jan 20, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Yuan3.0-1T Base
#Shots=-, Architecture...
2026.01
59.4
DeepSeek-V3-Base
#Shots=-, Architecture...
2026.01
54.8
LLaMA-3.1-405B Base
#Shots=-, Architecture...
2026.01
54.2
Feedback
Search any
task
Search any
task