Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-Query Attention (MQA) Performance on NVIDIA A100 GPU

211.4TFLOPS

cuDNN

154.304169.127183.95198.773Jun 14, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.06
211.4-
2025.06
156.55.36