Share your thoughts, 1 month free Claude Pro on usSee more

LLM Decoding on ShareGPT

2.4Latency (ms/token)

Llama2-7B

Updated 4mo ago

Evaluation Results

Method	Links
Llama2-7B 2026.03		2.4	12.9
AdaFuse 2026.03		3.1	13.8
PESC 2026.03		8.5	13.1
MoRAL 2026.03		8.6	13.3
MOLA 2026.03		25.3	26.3