Share your thoughts, 1 month free Claude Pro on usSee more

Inference Latency Measurement on H100 GPU (16k inputs, test)

0.052Latency (s)

Lemon

Updated 5mo ago

Evaluation Results

Method	Links
Lemon 2025.12		0.052
Qwen2.5-VL 2025.12		0.0588
LLaVA-1.5 2025.12		0.0672
ShapeLLM 2025.12		0.0745
3D-LLM 2025.12		0.0762