Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Question Answering on ScienceQA (test)

92.53Accuracy

LLaVa + GPT-4 (judge)

37.72251.95166.1880.409Jul 3, 2023Dec 11, 2023May 21, 2024Oct 30, 2024Apr 10, 2025Sep 19, 2025Feb 28, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2023.07
92.53-91.5696.7491.0990.6288.9993.5292.7392.16
2023.07
91.68-95.918290.8295.2688.892.8992.4490.31
2023.12
91.288.6--------
2023.07
90.92-90.3695.958889.498890.6690.9390.9
2023.07
90.03-89.395.618793.0886.6791.7584.3791.3
2023.12
89.885.8--------
2023.12
89.284.5--------
2023.07
88.4-90.2384.9787.4889.687.588.191.5982.42
2023.07
86.54-89.8374.1389.8288.2777.6492.1388.0383.72
2023.07
86.11-84.594.1582.9188.3583.6488.7485.0585.6
2023.07
85.61-84.3692.2382.8189.5681.2688.2981.2886.03
2023.07
85.19-84.3788.384.3683.7280.3286.985.8384.05
2023.07
84.91-87.5277.1785.8287.8882.986.8384.6585.37
2023.07
83.99-85.4872.4490.2782.6571.4992.8986.6679.04
2023.07
79.93-81.6270.648479.7770.886.6281.8676.53
2026.02
79.84---------
2026.02
79.84---------
2026.02
79.84---------
2026.02
79.64---------
2026.02
79.64---------
2026.02
79.24---------
2026.02
79.24---------
2023.07
78.31-78.8270.9883.1877.3767.9286.1380.7274.03
2026.02
78.24---------
2026.02
78.24---------
2026.02
78.04---------
2026.02
78.04---------
2026.02
78.04---------
2026.02
77.45---------
2026.02
76.45---------
2026.02
76.25---------
2026.02
76.25---------
2026.02
75.25---------
2023.07
75.17-75.4470.8778.0974.6867.4379.9378.2369.68
2026.02
74.25---------
2026.02
74.25---------
2026.02
74.25---------
2023.07
74.11-7176.0478.9166.4266.5381.8177.0668.82
2023.07
74.04-75.0466.597874.2465.7479.5876.3669.87
2026.02
73.85---------
2026.02
73.65---------
2026.02
72.85---------
2026.02
72.85---------
2026.02
72.85---------
2026.02
72.26---------
2026.02
72.26---------
2026.02
71.06---------
2026.02
71.06---------
2026.02
70.46---------
2026.02
70.46---------
2026.02
70.46---------
2026.02
70.26---------
2023.07
70.12-68.1669.1874.9163.7861.3877.8472.9865
2026.02
70.06---------
2026.02
69.86---------
2026.02
69.86---------
2026.02
69.66---------
2026.02
67.81---------
2026.02
67.27---------
2026.02
66.27---------
2026.02
63.52---------
2026.02
62.28---------
2026.02
61.88---------
2026.02
59.28---------
2023.07
39.83-40.2846.1329.2547.4540.0833.6639.3540.67