Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Visual Question Answering on RealWorldQA (test)

79Accuracy

GPT5 mini

25.4439.34553.2567.155Oct 11, 2024Jan 15, 2025Apr 22, 2025Jul 28, 2025Nov 1, 2025Feb 6, 2026May 14, 2026
Updated 15d ago

Evaluation Results

MethodLinks
2026.01
79-
2026.01
78.4-
2026.01
77.4-
2026.01
76-
2026.01
75.6-
2024.10
75.4-
2026.01
74.9-
2026.01
73.5-
2026.01
71-
2025.10
70.58-
2026.01
70.5-
2024.10
69.7-
2026.01
69.4-
2025.10
68.88-
2025.12
68.5-
2026.01
68.4-
2026.01
68.4-
2026.01
68.2-
2026.01
68.2-
2025.12
68-
2026.01
67.2-
2024.10
67.1-
2025.12
66.3-
2025.10
65.88-
2025.12
65.4-
2026.01
64.4-
2025.12
64.2-
2024.10
63.5-
2025.12
63.2-
2024.10
62.6-
2025.12
61.6-
2026.01
61.4-
2024.10
59-
2026.01
57.5-
2026.01
56.7-
2026.05
56.60
2026.01
56-
2026.01
55.1-
2026.05
54.95.9
2026.05
53.63.9
2026.01
52.9-
2026.05
52.93.7
2026.05
51.74.9
2026.05
49.76.9
2026.01
41.9-
2026.05
36.619.6
2026.05
27.539.8