Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA Language Modeling Evaluation benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
Language Modeling Evaluation
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Open LLM Leaderboard
Xwin-LM v0.1
ARC
70.22
14
4d ago
Bolmo 1B evaluation suite
BLT 1B
Overall Average Score
58.5
5
4d ago
ARC, HellaSwag, MMLU, TruthfulQA, WinoGrande
BOFT
ARC Accuracy
34.64
4
4d ago
Showing 3 of 3 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task