Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Grouped-Query Attention (GQA) on NVIDIA A100 GPU Synthetic Performance

154.7TFLOPS

DeepSeek-V3 + Ours

-1.61238.96979.55120.131Jun 14, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.06
154.735.16
2025.06
4.4-