Share your thoughts, 1 month free Claude Pro on usSee more

Long-Context LLM Inference Prefill Performance

0.62Prefill Latency (ms)

Kascade

Updated 1mo ago

Evaluation Results

Method	Links
Kascade 2025.12		0.62	-	1.23	1.62
FA3 2025.12		0.76	-	-	-
Tilelang (TL) 2025.12		1	-	-	-
Kascade 2025.12		408.3	-	2.12	2.57
Kascade 2025.12		727.55	-	1.19	1.44
Anchor 2025.12		2,192.39	2.09	-	-