Share your thoughts, 1 month free Claude Pro on usSee more

vLLM Inference Performance on Qwen3-32B

7.641Model Load Time (s)

Safetensors

Updated 5mo ago

Evaluation Results

Method	Links
Safetensors 2025.12		7.641	218.374	43.09	12,752.61	61.57
CryptoTensors 2025.12		7.668	217.488	42.15	12,757.07	61.57
CryptoTensors 2025.12		111.85	218.584	42.93	12,795.65	61.57