Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Sparse Decoding (MHA) on Synthetic 128K Context (H100, FP16)

0.7FlashInfer Latency (ms)

FlashInfer

0.28083.11045.948.7696May 22, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.05
0.7------
2026.05
2.79------
2026.05
5.61------
2026.05
11.18------
2026.05
-0.911.622.182.683.143.37
2026.05
-1.021.862.563.173.744
2026.05
-1.1222.713.323.864.11
2026.05
-1.222.132.843.433.944.17