Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Prominent Language Benchmarks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Zero-shot Language ModelingProminent Language Benchmarks (ARC, BoolQ, HellaSwag, OpenBookQA, PIQA, SciQ, TriviaQA, Winogrande)
ARC-Challenge Acc28.16
5
Showing 1 of 1 rows