Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Masked Grouped Query Attention on T4 GPU Synthetic Performance Benchmark

21.58Performance (TFLOPS)

DeepSeek-V3 + Ours

7.675211.285114.89518.5049Jun 14, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.06
21.58
2025.06
15.13
2025.06
8.21