Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Standard benchmarks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language Modeling and Question AnsweringStandard Benchmarks (ARC-E, ARC-C, BoolQ, HellaSwag, OBQA, PIQA, WinoGrande, MMLU, SciQ) (test)
ARC-E Acc (Norm)49.75
8
Text-to-imageStandard text-to-image benchmarks
CLIP Score97.28
6
Showing 2 of 2 rows