Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Inference Scheduling on Alpaca

0.52Average Per-token Latency (s/token)

TIE

0.361.442.523.6Apr 1, 2026
Updated 17d ago

Evaluation Results

MethodLinks
2026.04
0.520.9354.18123.02
2026.04
0.711.4762.93140.8
2026.04
0.831.7665.33142.17
2026.04
1.453.6283.65159.72
2026.04
1.542.81146.14359.08
2026.04
2.064.32171.14395.78
2026.04
2.365.14174.31394.1
2026.04
4.5211.36235.64444.52