Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Inference Latency on VLM prefill 512 tokens

59.4Prefill Latency (ms)

W4A8

59.02461.56264.166.638Dec 27, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.12
59.4
2024.12
68.8