Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Visual Question Answering on GQA
Loading...
45.5
Accuracy
GenLIP
39.572
41.111
42.65
44.189
May 1, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
GenLIP
Arch=g/16, Data=8.0B
2026.05
45.5
SigLIP2
Arch=g/16, Data=40.0B
2026.05
45.2
GenLIP
Arch=So/16, Data=8.0B
2026.05
44
AIMv2
Arch=L/14, Data=12.0B
2026.05
43.9
SigLIP2
Arch=So/16, Data=40.0B
2026.05
43.5
OVision2
Arch=L/16, Data=12.8B
2026.05
42.7
SigLIP2
Arch=L/16, Data=40.0B
2026.05
42.6
SigLIP
Arch=L/16, Data=40.0B
2026.05
41.7
GenLIP
Arch=L/16, Data=8.0B
2026.05
41.5
CLIP
Arch=L/14, Data=12.8B
2026.05
39.8
Feedback
Search any
task
Search any
task