Share your thoughts, 1 month free Claude Pro on usSee more

Quantization on LLAMA v1 (train)

0.25Processing Time (hr)

OPTQ

Updated 3mo ago

Evaluation Results

Method	Links
OPTQ 2024.02		0.25
OPTQ 2024.02		0.45
OPTQ 2024.02		1.08
Z-FOLD 2024.02		1.13
OmniQuant 2024.02		2.37
Z-FOLD 2024.02		2.48
OmniQuant 2024.02		4.2
aespa 2024.02		6.84
OmniQuant 2024.02		9.84
Z-FOLD 2024.02		10.51
aespa 2024.02		15.89
AffineQuant 2024.02		18.76
AffineQuant 2024.02		18.76
AffineQuant 2024.02		47.84
aespa 2024.02		53.69