Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Large Language Model Inference on Synthetic heavy-tail workload Pareto distribution

17.02Throughput (req/s)

BatchLLM

4.78967.964811.1414.3152Nov 29, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.11
17.023.2
2024.11
5.261