Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
LMM Inference Efficiency on 8x RTX 4090 GPUs
Loading...
1.93
FLOPs (T)
LLaVA-FA-2B
1.8892
2.1646
2.44
2.7154
Jan 28, 2026
FLOPs (T)
Latency (ms)
Updated 4d ago
Evaluation Results
Method
Method
Links
FLOPs (T)
Latency (ms)
LLaVA-FA-2B
#I (Number of image to...
2026.01
1.93
30.6
DeepSeek-VL-1.3B
#I (Number of image to...
2026.01
2.01
35.1
Imp-2B
#I (Number of image to...
2026.01
2.14
36.2
MoE-LLaVA-2B
#I (Number of image to...
2026.01
2.48
39.3
Bunny-2B
#I (Number of image to...
2026.01
2.81
40.1
Mini-Gemini-2B
#I (Number of image to...
2026.01
2.95
41.5
Feedback
Search any
task
Search any
task