Share your thoughts, 1 month free Claude Pro on usSee more

vLLM Inference Performance on Qwen3-0.6B

0.336Model Load Time (s)

Safetensors

Updated 4mo ago

Evaluation Results

Method	Links
Safetensors 2025.12		0.336	90.512	197.15	10,557.58	1.17
CryptoTensors 2025.12		0.34	91.1	195.3	10,558.89	1.17
CryptoTensors 2025.12		2.073	91.062	193.92	10,613.48	1.17