Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Masked Grouped Query Attention on T4 GPU Synthetic Performance Benchmark
Loading...
21.58
Performance (TFLOPS)
DeepSeek-V3 + Ours
7.6752
11.2851
14.895
18.5049
Jun 14, 2025
Performance (TFLOPS)
Updated 4d ago
Evaluation Results
Method
Method
Links
Performance (TFLOPS)
DeepSeek-V3 + Ours
Head Dimension=64, Seq...
2025.06
21.58
Flex Attention
Head Dimension=128, Se...
2025.06
15.13
cuDNN
Head Dimension=64, Seq...
2025.06
8.21
Feedback
Search any
task
Search any
task