Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Inference Latency on OPT-30B
Loading...
15.7
Latency (ms)
LUT-GEMM
14.708
21.404
28.1
34.796
Jun 4, 2025
Latency (ms)
Updated 4d ago
Evaluation Results
Method
Method
Links
Latency (ms)
LUT-GEMM
Kernel=LUT-GEMM, bit w...
2025.06
15.7
LUT-GEMM
Kernel=LUT-GEMM, bit w...
2025.06
16.7
LUT-GEMM
Kernel=LUT-GEMM, bit w...
2025.06
17.8
LUT-GEMM
Kernel=LUT-GEMM, bit w...
2025.06
18.5
cuBLAS
Kernel=cuBLAS, bit wid...
2025.06
40.5
Feedback
Search any
task
Search any
task