Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HuggingFace Open LLM Leaderboard

Benchmarks

Task NameDataset NameSOTA ResultTrend
Large Language Model EvaluationHuggingFace Open LLM Leaderboard
ARC65.96
21
Large Language Model EvaluationHuggingFace Open LLM Leaderboard lm-eval-harness default (various)
HellaSwag84.34
18
Showing 2 of 2 rows