Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

vLLM Inference Performance on Qwen3-4B

1.101Model Load Time (s)

Safetensors

0.668323.588916.50959.43009Dec 4, 2025
Updated 4d ago

Evaluation Results

MethodLinks
1.101123.277126.1311,075.597.58
2025.12
1.104123.406125.2311,088.677.58
2025.12
11.918123.467126.8411,608.827.58