Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Visual Reasoning on Bongard-OpenWorld 500-sample

93.6Overall Score

Gemini 2.0

51.58462.49273.484.308Jan 23, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.01
93.690.896.4
2025.01
92.892.892.8
2025.01
91--
2025.01
8882.893.2
2025.01
87.288.885.6
2025.01
86.885.787.9
2025.01
82.280.583.9
2025.01
8066.493.6
2025.01
66.265.267.2
2025.01
55.158.751.5
2025.01
53.442.864
2025.01
53.29313.1