Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Visual Question Answering on SimpleVQA (Accuracy)

0.737Accuracy

Gemini-3-Flash

0.286680.403590.52050.63741Nov 7, 2025Dec 3, 2025Dec 30, 2025Jan 25, 2026Feb 21, 2026Mar 19, 2026Apr 15, 2026
Updated 2d ago

Evaluation Results

MethodLinks
2026.04
0.737
2026.04
0.702
2026.04
0.688
2026.04
0.686
2026.03
0.676
2026.01
0.6644
2026.03
0.659
2026.04
0.659
2026.01
0.6565
2026.04
0.649
2026.01
0.6486
2026.03
0.636
2026.04
0.633
2026.04
0.63
2026.01
0.6269
2026.04
0.625
2026.03
0.623
2026.03
0.62
2026.01
0.619
2026.04
0.619
2026.03
0.617
2026.04
0.617
2026.04
0.616
2026.02
0.6154
2026.01
0.6061
2026.04
0.605
2026.04
0.604
2026.03
0.594
2025.11
0.594
2026.04
0.594
2026.04
0.594
2026.04
0.593
2026.01
0.5913
2026.03
0.59
2026.04
0.59
2026.03
0.587
2026.04
0.586
2026.03
0.58
2026.03
0.574
2025.11
0.574
2026.04
0.574
2026.04
0.567
2026.03
0.559
2026.04
0.559
2026.04
0.559
2026.03
0.558
2026.04
0.555
2026.04
0.555
2026.04
0.553
2026.01
0.5479
2026.03
0.543
2025.11
0.543
2026.04
0.543
2026.04
0.543
2026.03
0.541
2025.11
0.534
2026.04
0.534
2025.12
0.527
2025.12
0.522
2026.03
0.52
2026.03
0.517
2025.11
0.516
2026.04
0.516
2025.12
0.508
2025.12
0.502
2026.04
0.485
2025.12
0.481
2026.03
0.471
2025.11
0.466
2026.04
0.466
2026.01
0.4373
2026.01
0.4344
2026.04
0.429
2026.01
0.4284
2026.03
0.424
2026.01
0.4235
2026.01
0.4176
2026.01
0.4166
2026.01
0.4126
2026.01
0.4018
2026.04
0.401
2026.03
0.4
2026.03
0.397
2026.03
0.395
2025.12
0.39
2026.03
0.39
2026.03
0.387
2025.11
0.384
2026.03
0.38
2026.03
0.38
2026.03
0.38
2026.03
0.376
2026.03
0.375
2026.03
0.363
2026.03
0.358
2026.04
0.358
2026.03
0.355
2026.03
0.344
2026.04
0.304