Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
PrefixLM Attention on Qwen2.5 72B (q=64, k=8, 1k Context)
Loading...
103.61
PrefixLM Attention Throughput (TFLOPS)
CuBridge
11.0188
35.0569
59.095
83.1331
May 6, 2026
PrefixLM Attention Throughput (TFLOPS)
Updated 27d ago
Evaluation Results
Method
Method
Links
PrefixLM Attention Throughput (TFLOPS)
CuBridge
Implementation=CuBridge
2026.05
103.61
FlexAttention
Implementation=PyTorch...
2026.05
99.82
Qimeng Attn
Implementation=Qimeng...
2026.05
72.37
Torch
Implementation=Standar...
2026.05
14.58
Feedback
Search any
task
Search any
task