Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Inference Efficiency on LLaMA 8B 8K context length 3.1

159Theoretical Compute (TFLOPs)

SpecKV

135.08141.29147.5153.71Mar 11, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
1598133779.53411121
2026.03
1598133779.53411120.51
2026.03
137132581.0330211
2026.03
137445492234.59800509
137132581.0330210.88
2026.03
137445492234.59800509.38
2026.03
13613257-291-
2026.03
136132570.0131120
13613257-291-
2026.03
136132570.0131120.17