Share your thoughts, 1 month free Claude Pro on usSee more

LLM Inference on Aggregate Suite (Alpaca, CodeAlpaca, HumanEval, LiveCodeBench, Math500, MBPP, MT-Bench)

2.87Mean Speedup

DART

Updated 4mo ago

Evaluation Results

Method	Links
DART 2026.01		2.87	3.87
DART 2026.01		2.85	4.08
DART 2026.01		2.77	3.67
DART 2026.01		2.71	3.61
Hydra 2026.01		2.66	3.55
DART 2026.01		2.61	3.6
DART 2026.01		2.47	3.61
DART 2026.01		2.42	3.76
Medusa 2026.01		2.24	2.68
EAGLE3 2026.01		2.2	3.72
DART 2026.01		2.19	3.55
EAGLE3 2026.01		2.12	3.54
EAGLE3 2026.01		2.11	3.85
EAGLE3 2026.01		2.02	3.48
EAGLE3 2026.01		2.01	3.8
EAGLE3 2026.01		1.97	3.67
EAGLE3 2026.01		1.89	3.38
PLD 2026.01		1.74	1.92
Lookahead 2026.01		1.61	1.81
SPS 2026.01		1.09	3.45
SPS 2026.01		0.98	4.17