Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Modeling on WikiText-2, C4, and PTB Average

26.68Average Perplexity

Shortened LLaMA + Ghost Layer

22.221652.315882.41112.5042May 15, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2026.05
26.68
2026.05
39.8
2026.05
76.76
2026.05
87.61
2026.05
124
2026.05
138.14