Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OWT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingOWT
Gen. PPL47.13
61
Text GenerationOWT
GPT2 Perplexity5.33
41
Conditional GenerationOWT
Perplexity (PPL)19.99
24
Language ModelingOWT L=1024 (test)
NELBO PPL20.96
11
Language ModelingOWT (val)
PPL17.5
7
Generation LatencyOWT L=8192
Generation Latency (s)54
5
Generation LatencyOWT L=2048
Sampling Latency (s)13.3
5
Language ModelingOWT L=10240 (test)
Gen. PPL23.4
2
Showing 8 of 8 rows