Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM Inference on WebLLM Windows 11 RTX PRO 2000 Blackwell 8GB
Loading...
51.1
Decode Throughput (tok/s)
Qwen2.5-0.5B
7.42
18.76
30.1
41.44
Feb 9, 2026
Decode Throughput (tok/s)
Prefill Throughput (tok/s)
Updated 13d ago
Evaluation Results
Method
Method
Links
Decode Throughput (tok/s)
Prefill Throughput (tok/s)
Qwen2.5-0.5B
Browser=Chrome 144, Ba...
2026.02
51.1
650
Qwen2.5-1.5B
Browser=Chrome 144, Ba...
2026.02
45.7
350
Qwen2.5-0.5B
Browser=Firefox 147, B...
2026.02
9.1
73
Qwen2.5-1.5B
Browser=Firefox 147, B...
2026.02
9.1
55
Feedback
Search any
task
Search any
task