Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Weight Compression Latency on 4096x4096 Weight Tensor
Loading...
0.69
Encoding Latency (s)
GPTQ
-0.1508
5.5246
11.2
16.8754
Oct 13, 2025
Encoding Latency (s)
Decoding Latency (ms)
Synthesis Latency (ms)
Entropy Decoding Latency (ms)
Updated 6d ago
Evaluation Results
Method
Method
Links
Encoding Latency (s)
Decoding Latency (ms)
Synthesis Latency (ms)
Entropy Decoding Latency (ms)
GPTQ
Hardware=NVIDIA RTX 60...
2025.10
0.69
0.08
-
-
NWC
Hardware=NVIDIA RTX 60...
2025.10
1.64
1.17
1.07
0.1
QTIP
Hardware=NVIDIA RTX 60...
2025.10
19.84
0.52
-
-
QuIP#
Hardware=NVIDIA RTX 60...
2025.10
21.71
1.33
-
-
Feedback
Search any
task
Search any
task