Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Inference on Llama 3.2 Samsung Galaxy S25 Ultra 1B (test)

2,813.19Prefill Min Throughput (tokens/sec)

ET QNN

36.702757.5211,478.342,199.159May 5, 2026
Updated 22d ago

Evaluation Results

MethodLinks
2026.05
2,813.192,976.7446.546.571,434
2026.05
2,277.92,392.3452.0352.721,229
2026.05
1,207.551,207.5566.2566.41676
2026.05
1,064.81,092.736.742728
2026.05
927.54930.9159.1959.4920
2026.05
649.75658.171.6971.97742
2026.05
524.59528.9365.8867.1821
2026.05
512.7537.865.666.5728
2026.05
490.63538.274.676.56659
2026.05
329.9374.423.625.6728
2026.05
284.12328.7660.3764.73769
2026.05
185.35190.2634.334.56612
2026.05
143.49149.6331.3633.03667