Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Modeling

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingLanguage Modeling
Perplexity8.005
26
Language ModelingLanguage Modeling Evaluation
Perplexity (PPL)1.71
16
Language ModelingLanguage Modeling Average
PPL5.67
12
Membership Inference AttackLanguage Modeling PII-annotated (train)
TPR @ 0.1% FPR21.5
9
Language ModelingLanguage Modeling (LM)
CE (128-255 tokens)2.69
7
Language ModelingLanguage Modeling (test)
PPL6.2
7
Showing 6 of 6 rows