Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Inference Latency on ViT encoder prefill 729 tokens
Loading...
9.7
Latency (ms)
W4A8
9.64
10.045
10.45
10.855
Dec 27, 2024
Latency (ms)
Updated 4d ago
Evaluation Results
Method
Method
Links
Latency (ms)
W4A8
Model=LLaVA-onevision-...
2024.12
9.7
FP16
Model=LLaVA-onevision-...
2024.12
11.2
Feedback
Search any
task
Search any
task