Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Grouped-Query Attention (GQA) on NVIDIA A100 GPU Synthetic Performance
Loading...
154.7
TFLOPS
DeepSeek-V3 + Ours
-1.612
38.969
79.55
120.131
Jun 14, 2025
TFLOPS
Speedup
Updated 4d ago
Evaluation Results
Method
Method
Links
TFLOPS
Speedup
DeepSeek-V3 + Ours
Head Dimension=64, Seq...
2025.06
154.7
35.16
DeepSeek-V3
Head Dimension=64, Seq...
2025.06
4.4
-
Feedback
Search any
task
Search any
task