Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Inference Latency Measurement on H100 GPU (16k inputs, test)
Loading...
0.052
Latency (s)
Lemon
0.051032
0.057566
0.0641
0.070634
Dec 14, 2025
Latency (s)
Updated 4d ago
Evaluation Results
Method
Method
Links
Latency (s)
Lemon
Backbone Size=7B
2025.12
0.052
Qwen2.5-VL
Backbone Size=7B
2025.12
0.0588
LLaVA-1.5
Backbone Size=13B
2025.12
0.0672
ShapeLLM
Backbone Size=7B
2025.12
0.0745
3D-LLM
Backbone Size=7B
2025.12
0.0762
Feedback
Search any
task
Search any
task