Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

One Billion Word

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingOne Billion Word (OBW) 100% train set (test)
PPL39.15
11
Language ModelingOne Billion Word (OBW) 1% train set (test)
PPL63.83
11
Language ModelingOne Billion Word corpus 1M sentences
Perplexity71.8
5
Text GenerationOne Billion Word (test)
4-gram JSD0.22
2
Language ModelingOne Billion Word Benchmark (train)
Perplexity36.39
2
Language GenerationOne Billion Word 6-gram
JSD0.74
2
Language GenerationOne Billion Word 4-gram
JSD0.35
2
Showing 7 of 7 rows