Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Inference on Phi4 Mini Samsung Galaxy S25 Ultra 3.8B (test)

1,161.29Prefill Throughput (min, tokens/sec)

ET QNN

15.9692313.3121610.655907.9979May 5, 2026
Updated 22d ago

Evaluation Results

MethodLinks
2026.05
1,161.291,229.2718.1219.633,584
2026.05
343.57372.618.7519.672,088
2026.05
339.1341.511.613.12,216
2026.05
195.7202.620.5222,201
2026.05
191.2238.9216.0316.382,829
2026.05
151.2153.322.122.92,216
2026.05
143.6159.718.519.92,428
2026.05
114.413012.112.52,216
2026.05
92.08106.5418.8120.021,994
2026.05
60.0265.7217.3718.292,337