Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

vLLM Inference Performance on Qwen3-8B

1.732Model Load Time (s)

Safetensors

0.719047.5565214.39421.23148Dec 4, 2025
Updated 4d ago

Evaluation Results

MethodLinks
1.732122.039115.8711,095.9815.3
2025.12
1.806122.563115.6111,108.9815.3
2025.12
27.056123.134112.8511,233.9915.3