Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Inference Scheduling on LMSYS-Chat-1M

2.41Average Per-token Latency (s/token)

TIE

2.14323.94415.7457.5459Apr 1, 2026
Updated 17d ago

Evaluation Results

MethodLinks
2026.04
2.414.05204.03475.1
2026.04
4.347.03252.2507.35
2026.04
5.58.24273.3551.15
2026.04
9.0816.13319.51618.21