Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Language Modeling on 100 Billion Word Google News Dataset (test)

38.2Test Perplexity (0.1 epochs)

MoE-16384-h

37.04444.84752.6560.453Jan 23, 2017
Updated 4d ago

Evaluation Results

MethodLinks
2017.01
38.229.7
2017.01
38.228.9
2017.01
38.930.9
2017.01
39.829.2
2017.01
40.332.7
2017.01
42.835.3
2017.01
48.540.4
2017.01
54.547
2017.01
67.145.3