Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM Inference on WebLLM macOS Apple M2 16GB unified memory
Loading...
46.4
Decode Throughput (tok/s)
Qwen2.5-0.5B
8.128
18.064
28
37.936
Feb 9, 2026
Decode Throughput (tok/s)
Prefill Throughput (tok/s)
Updated 13d ago
Evaluation Results
Method
Method
Links
Decode Throughput (tok/s)
Prefill Throughput (tok/s)
Qwen2.5-0.5B
Browser=Chrome 143, Ba...
2026.02
46.4
510
Qwen2.5-0.5B
Browser=Safari 26.2, B...
2026.02
41.7
257
Qwen2.5-1.5B
Browser=Chrome 143, Ba...
2026.02
36
225
Qwen2.5-1.5B
Browser=Safari 26.2, B...
2026.02
29.7
93
Qwen2.5-0.5B
Browser=Firefox 147, B...
2026.02
9.6
77
Qwen2.5-1.5B
Browser=Firefox 147, B...
2026.02
9.6
58
Feedback
Search any
task
Search any
task