Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

vLLM Inference Performance on Qwen3-32B

7.641Model Load Time (s)

Safetensors

3.4726431.6090759.745587.88193Dec 4, 2025
Updated 4d ago

Evaluation Results

MethodLinks
7.641218.37443.0912,752.6161.57
2025.12
7.668217.48842.1512,757.0761.57
2025.12
111.85218.58442.9312,795.6561.57