Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AdaptEval

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingAdaptEval (test)
NLL1.6598
32
Language Model EvaluationAdaptEval
ROUGE-Lsum0.2733
14
Showing 2 of 2 rows