Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AdaptEval

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingAdaptEval (test)
NLL1.6598
32
Language Model EvaluationAdaptEval
ROUGE-Lsum0.2733
14
Showing 2 of 2 rows