Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-Head Attention Performance on A100 GPU (hd=64, sl=1024)
Loading...
175.6
TFLOPS
LLM-TL
162.184
165.667
169.15
172.633
Jun 14, 2025
TFLOPS
Latency
Updated 4d ago
Evaluation Results
Method
Method
Links
TFLOPS
Latency
LLM-TL
Implementation Library...
2025.06
175.6
10
Human Expert
Implementation Library...
2025.06
162.7
-
Feedback
Search any
task
Search any
task