Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Masked Multi-Head Attention on T4 GPU Synthetic Performance Benchmark

19.07Performance (TFLOPS)

DeepSeek-V3 + Ours

1.58766.126310.66515.2037Jun 14, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.06
19.07
2025.06
13.45
2025.06
12.81
2025.06
8.11
2025.06
2.26