Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

vLLM Inference Performance on Qwen3-1.7B

0.54Model Load Time (s)

Safetensors

0.33561.71533.0954.4747Dec 4, 2025
Updated 4d ago

Evaluation Results

MethodLinks
0.54101.911198.4510,547.663.25
2025.12
0.594101.727190.910,560.643.25
2025.12
5.65102.108195.1910,666.213.25