Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Masked Language Modeling on WikiText-103 (train)

6.6188Training Loss

GEM (N = 1)

6.6178086.6245046.63126.637896Apr 23, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
6.6188
2026.04
6.6267
2026.04
6.628
2026.04
6.6341
2026.04
6.6413
2026.04
6.6436