Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Matrix Multiplication Latency on Llama-3 70B
Loading...
293.82
Kernel Latency (µs)
CodeGEMM
214.1528
751.9064
1,289.66
1,827.4136
Dec 19, 2025
Kernel Latency (µs)
Updated 4d ago
Evaluation Results
Method
Method
Links
Kernel Latency (µs)
CodeGEMM
config=m1v4g128
2025.12
293.82
LUTGEMM
quantization=q2-g128
2025.12
299.9
CodeGEMM
config=m2v8g128
2025.12
373.49
QuIP#
config=e8p
2025.12
403.59
QTIP
config=r2
2025.12
477.04
AQLM
config=2x8
2025.12
674.67
cuBLAS
precision=fp16
2025.12
1,111.36
AQLM
config=1x16
2025.12
2,285.5
Feedback
Search any
task
Search any
task