Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Character-level Language Modeling benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Character-level Language Modeling
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
enwik8 (test)
Transformer-XL + RMS dynamic eval + decay
BPC
0.94
195
3mo ago
text8 (test)
Transformer-XL + RMS dynamic eval + decay
BPC
1.038
128
3mo ago
Penn Treebank (test)
3 layer LSTM
BPC
1.175
113
3mo ago
Shakespeare modern
SGHMC
Accuracy
55.63
48
3mo ago
Hutter Prize Wikipedia (test)
mLSTM
Bits/Char
1.08
28
3mo ago
Shakespeare (val)
Byz-NSGDM
Perplexity
10.08
27
2mo ago
Penn Treebank char-level (test)
dense-IndRNN
BPC
1.16
25
3mo ago
Tiny Shakespeare (val)
EGA-MORLET
Validation Loss
1.355
19
20h ago
Enwik8 (val)
Adaptive Transformer
BPC
1.04
17
25d ago
text8
GPT2
BPC
0.98
16
3mo ago
text8 (held-out 1M tokens)
SHARP
BPC
2.3
14
21h ago
text8 (dev)
Transformer + adaptive span
BPC
1.01
13
3mo ago
Shakespeare (train)
Adam
Accuracy
59.8
12
1mo ago
Shakespeare (test)
Adam
Accuracy
50.2
12
1mo ago
enwik8 (train)
RWKV-RNN
BPC
0.72
12
3mo ago
enwik8 (dev)
Adaptive
BPC
1
10
3mo ago
Penn Treebank character-level (val)
LayerNorm HM-LSTM
BPC
1.24
10
11d ago
text8 (most recent 1M tokens)
SHARP
BPC
2.23
7
21h ago
text8 100M regime Backward stream
Transformer (ctx=1024)
Backward BPC
2.17
7
21h ago
text8 100M regime (Current stream split)
Transformer (ctx=1024)
Current BPC
2.12
7
21h ago
text8 100M regime (Forward split)
Transformer (ctx=1024)
Forward BPC
2.19
7
21h ago
Shakespeare
MINGRU + αCMRU
Cross-entropy Loss
1.441
6
20d ago
Billion Words (test)
DGflow
JS Divergence (Context 4)
0.186
4
3mo ago
Character-level domain-switching dataset
β-MoE
BPC (all)
1.659
3
29d ago
WikiText-2 (val)
Transformer
PPL (Validation)
3.49
3
3mo ago
Showing 25 of 29 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs