Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Attention Operator Throughput on Qwen2.5 72B (64 Q-heads/8 KV-heads/128 Head-dimension)

222.5Attention Throughput (TFLOPS)

flash-attn v2

0.25257.951115.65173.349Jun 14, 2025
Updated 27d ago

Evaluation Results

MethodLinks
2025.06
222.5----
2025.06
216----
2025.06
207.9----
2025.06
205.1----
2025.06
202.1----
2025.06
201.8----
2025.06
201.8----
2025.06
192.9----
2025.06
191.8----
2025.06
186.4----
2025.06
181.5----
2025.06
176.1----
2025.06
172.4----
2025.06
168.1----
2025.06
165.1----
2025.06
157.8----
2025.06
154.7----
2025.06
148.3----
2025.06
140.2----
2025.06
136.7----
2025.06
127.1----
2025.06
117.4----
2025.06
116.3----
2025.06
86.1----
2025.06
11.2----
2025.06
11.1----
2025.06
10.9----
2025.06
10.2----
2025.06
8.8----
2026.05
-28.419.9925.6729.77
2026.05
-248.79333.7382.13402.38
2026.05
-113.24163.79207.47232.61
2026.05
-304.35486.19599.14615.26
2026.05
-72.5282.388.8792.94
2026.05
-253.49346.74415443.19
2026.05
-521.33632.6622.52637.52