Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Accuracy-based tasks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Zero-shot downstream reasoning and question answeringAccuracy-based tasks (ARC-E, PIQA, SciQ, HellaSwag, LAMBADA, WinoGrande, BoolQ) zero-shot
ARC-E51.22
2
Showing 1 of 1 rows