Multi-Head Attention Performance on A100 GPU (hd=64, sl=1024)

175.6TFLOPS

LLM-TL

Updated 5mo ago

Evaluation Results

Method	Links
LLM-TL 2025.06		175.6	10
Human Expert 2025.06		162.7	-