Share your thoughts, 1 month free Claude Pro on usSee more

vLLM Inference Performance on Qwen3-4B

1.101Model Load Time (s)

Safetensors

Updated 4mo ago

Evaluation Results

Method	Links
Safetensors 2025.12		1.101	123.277	126.13	11,075.59	7.58
CryptoTensors 2025.12		1.104	123.406	125.23	11,088.67	7.58
CryptoTensors 2025.12		11.918	123.467	126.84	11,608.82	7.58