Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Inference Latency on VLM prefill 1024 tokens
Loading...
92.7
Latency (ms)
W4A8
92.048
96.449
100.85
105.251
Dec 27, 2024
Latency (ms)
Updated 4d ago
Evaluation Results
Method
Method
Links
Latency (ms)
W4A8
Model=LLaVA-onevision-...
2024.12
92.7
FP16
Model=LLaVA-onevision-...
2024.12
109
Feedback
Search any
task
Search any
task