Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Inference Latency on ViT encoder prefill 729 tokens

9.7Latency (ms)

W4A8

9.6410.04510.4510.855Dec 27, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.12
9.7
2024.12
11.2