Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Self-attention inverse temperature scaling analysis on SlimPajama

6Tie Percentage

nanoGPT

2.6723.5364.45.264May 12, 2026
Updated 20d ago

Evaluation Results

MethodLinks
2026.05
60.350.520.860.38
2026.05
2.80.40.540.90.44