Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Masked Language Modeling on BERT Pretraining Corpus
Loading...
2.89
Perplexity
gMLP_xlarge
2.8344
3.2097
3.585
3.9603
May 17, 2021
Perplexity
Updated 4d ago
Evaluation Results
Method
Method
Links
Perplexity
gMLP_xlarge
Params (M)=941
2021.05
2.89
aMLP_large
Attn Size=128, Params...
2021.05
3.19
gMLP_large
Params (M)=365
2021.05
3.32
BERT_large (ours)
Attn Size=1024 (64 x 1...
2021.05
3.35
aMLP_base
Attn Size=64, Params (...
2021.05
3.95
BERT_base (ours)
Attn Size=768 (64 x 12...
2021.05
4.17
gMLP_base
Params (M)=130
2021.05
4.28
Feedback
Search any
task
Search any
task