Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Aggregated Benchmarks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Quantization Performance SummaryAggregated Benchmarks HellaSwag, MMLU, Arc-C, MATH-500
Average Score1.014
22
Overall Language Model EvaluationAggregated Benchmarks STEM Code IF General
Average Score61.7
7
Showing 2 of 2 rows