Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Accuracy Benchmarks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Zero-shot ClassificationAccuracy Benchmarks (PIQA, HellaSwag, LAMBADA, ARC-e, ARC-c, SciQ, Race, MMLU) Zero-shot
PIQA77.7
39
Showing 1 of 1 rows