Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language tasks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Few-shot Language UnderstandingLanguage tasks (SST-2, SST-5, SNLI, MNLI, RTE, TREC) few-shot k=16
Accuracy (SST-2)91.8
5
Showing 1 of 1 rows