Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
CPU Inference Performance Evaluation on Qwen3-30B-A3B
Loading...
16.2
Memory Usage (GB)
Llama.cpp (CPU)
14.572
25.561
36.55
47.539
Apr 12, 2026
Memory Usage (GB)
Latency (s)
Updated 5d ago
Evaluation Results
Method
Method
Links
Memory Usage (GB)
Latency (s)
Llama.cpp (CPU)
Bit Width=A8W4, Hardwa...
2026.04
16.2
20.1
CodeQuant (CPU)
Bit Width=A8W4, Hardwa...
2026.04
16.5
15.9
Llama.cpp (CPU)
Bit Width=BF16, Hardwa...
2026.04
56.9
66.1
Feedback
Search any
task
Search any
task