Share your thoughts, 1 month free Claude Pro on usSee more

PrefixLM Attention Performance on Llama2-7B (8k Context, 32/32 Heads)

163.7TFLOPS (PrefixLM Attention)

CuBridge

Updated 2mo ago

Evaluation Results

Method	Links
CuBridge 2026.05		163.7
FlexAttention 2026.05		142.77
Qimeng Attn 2026.05		122.44
Torch 2026.05		14.56