Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visual Question AnsweringGQA
Accuracy83.2
1,425
Visual Question AnsweringGQA
Accuracy74.9
524
Visual Question AnsweringGQA (test-dev)
Accuracy73.9
236
Visual Question AnsweringGQA (test)
Accuracy89.3
197
Visual Question AnsweringGQA
Mean Accuracy65.8
196
Visual Question AnsweringGQA
Score71.3
193
Performance EstimationGQA
MAE0
184
Visual Question AnsweringGQA
Accuracy77.5
155
Visual Question AnsweringGQA
GQA Score64.83
139
Visual ReasoningGQA
Accuracy64.54
93
Visual Question AnsweringGQA (test-std)
Accuracy65.65
74
Visual Question AnsweringGQA
GQA Score63.4
53
Multi-modal Vision-Language UnderstandingGQA
Accuracy64.2
51
Object Hallucination ProbingGQA POPE Popular
Accuracy86.07
49
Object Hallucination ProbingGQA POPE Random
Accuracy (GQA POPE)89.93
42
Object Hallucination ProbingGQA Adversarial
Accuracy82.73
40
Visual Question AnsweringGQA
Accuracy75.77
36
Multi-turn Visual Question AnsweringMT-GQA
Acc165.45
33
Visual Question AnsweringGQA balanced (test-dev)
Accuracy77.4
32
Visual Question AnsweringGQA (val)
Accuracy83.39
32
Visual Question AnsweringGQA
Accuracy61.97
31
Visual Question AnsweringGQA v1.0 (test)
Accuracy63.3
31
Refusal Rate EvaluationGQA
Refusal Rate77
30
Visual Question AnsweringGQA
Accuracy65.4
30
Visual Question AnsweringGQA
Accuracy63.9
29
Showing 25 of 130 rows