Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-path Speculative Decoding on Average across models (Qwen, Gemma, Llama) (test)

16.69Throughput (tokens/s)

SpecInfer NDE

14.755615.257815.7616.2622Feb 19, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
16.69
2026.02
16.4
2026.02
16.18
2026.02
15.89
2026.02
15.05
2026.02
14.83