Share your thoughts, 1 month free Claude Pro on usSee more

Matrix Multiplication Latency on Llama-3 8B

152.69Kernel-level latency (µs)

CodeGEMM

Updated 5mo ago

Evaluation Results

Method	Links
CodeGEMM 2025.12		152.69
LUTGEMM 2025.12		160.1
QuIP# 2025.12		162.63
CodeGEMM 2025.12		172.18
QTIP 2025.12		189.94
AQLM 2025.12		250.12
cuBLAS 2025.12		332.45
AQLM 2025.12		645.51