Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-Context LLM Inference Prefill Performance

0.62Prefill Latency (ms)

Kascade

-87.0508504.72711,096.5051,688.2829Dec 18, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
0.62-1.231.62
2025.12
0.76---
2025.12
1---
2025.12
408.3-2.122.57
2025.12
727.55-1.191.44
2025.12
2,192.392.09--