Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-path Speculative Decoding on Average across models (Qwen, Gemma, Llama) (test)

16.69Throughput (tokens/s)

SpecInfer NDE

14.755615.257815.7616.2622Feb 19, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
16.69
2026.02
16.4
2026.02
16.18
2026.02
15.89
2026.02
15.05
2026.02
14.83