Share your thoughts, 1 month free Claude Pro on usSee more

Large Language Model Inference on Llama 3.2 1B

1.94TPOTH

Baseline

Updated 2mo ago

Evaluation Results

Method	Links
Baseline 2026.03		1.94	7.69	3.6
Vocab. Trimming 2026.03		1.07	6.82	2.73
SVDSoftmax 2026.03		0.61	6.36	2.27
FlashHead 2026.03		0.4	6.15	2.06