Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-Head Attention Performance on NVIDIA L40S GPU (FP8, hd=128, Causal Mask)

257.9Performance (TFLOPS)

LLM-TL

223.476232.413241.35250.287Jun 14, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.06
257.9
2025.06
255.1
2025.06
254.6
2025.06
248.3
2025.06
241.1
2025.06
224.8