Share your thoughts, 1 month free Claude Pro on usSee more

Multi-Head Attention Performance on NVIDIA L40S GPU (FP8, hd=128, Causal Mask)

257.9Performance (TFLOPS)

LLM-TL

Updated 3mo ago

Evaluation Results

Method	Links
LLM-TL 2025.06		257.9
LLM-TL 2025.06		255.1
LLM-TL 2025.06		254.6
LLM-TL 2025.06		248.3
LLM-TL 2025.06		241.1
LLM-TL 2025.06		224.8