Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Inference Latency on VLM prefill 1024 tokens
Loading...
92.7
Latency (ms)
W4A8
92.048
96.449
100.85
105.251
Dec 27, 2024
Latency (ms)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Latency (ms)
W4A8
Model=LLaVA-onevision-...
2024.12
92.7
FP16
Model=LLaVA-onevision-...
2024.12
109
Feedback
Search any
task
Search any
task