Share your thoughts, 1 month free Claude Pro on usSee more

LLM Inference Performance on Context Length 200K

10.7Prefill Time (s)

IndexCache

Updated 2mo ago

Evaluation Results

Method	Links
IndexCache 2026.03		10.7	86	297
IndexCache 2026.03		13.7	73	253
DSA 2026.03		19.5	58	197