Share your thoughts, 1 month free Claude Pro on usSee more

vLLM Inference Performance on Qwen3-1.7B

0.54Model Load Time (s)

Safetensors

Updated 4mo ago

Evaluation Results

Method	Links
Safetensors 2025.12		0.54	101.911	198.45	10,547.66	3.25
CryptoTensors 2025.12		0.594	101.727	190.9	10,560.64	3.25
CryptoTensors 2025.12		5.65	102.108	195.19	10,666.21	3.25