Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Inference Latency on VLM prefill 1024 tokens

92.7Latency (ms)

W4A8

92.04896.449100.85105.251Dec 27, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.12
92.7
2024.12
109