Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Inference Performance on Context Length 120K

5.66Prefill Time (s)

IndexCache

5.54366.32937.1157.9007Mar 12, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
5.6688498
2026.03
6.5777431
2026.03
8.5763341