Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on ScienceQA

97.85Average Accuracy

MG2-RAG

71.48678.330585.17592.0195Nov 4, 2025Dec 5, 2025Jan 6, 2026Feb 6, 2026Mar 10, 2026Apr 10, 2026May 12, 2026
Updated 7d ago

Evaluation Results

MethodLinks
2026.04
97.8598.4595.2898.7398.4496.4398.6898.4296.84-
2026.04
97.3498.2795.1697.1897.996.4397.4997.7696.57-
2026.04
9797.7894.6597.0697.9396.2497.0597.3396.43-
2026.04
96.397.693.0595.9697.7395.3596.1796.5495.88-
2026.04
92.993.5394.1290.8192.9894.2590.2794.0390.93-
2026.04
89.2590.8187.9687.0989.9386.6188.9290.6486.75-
2026.04
88.490.2384.9787.4889.687.588.191.5982.42-
2026.04
87.6288.3781.2191.278778.2894.0188.8485.43-
2026.05
87.58---------
2026.04
87.588.7788.8683.8287.3988.0586.0689.9483.12-
2025.11
87.5--------65.2
2026.05
86.83---------
2026.05
85.73---------
2026.05
85.49---------
2026.04
84.7785.6175.9390.2784.474.1792.3385.7982.98-
2026.04
84.5385.1781.3385.8283.4376.189.4186.1281.67-
2026.05
84.39---------
2026.04
83.9985.4872.4490.2782.6571.4992.8986.6679.04-
2025.11
83.4--------240.2
2026.04
83.1684.1575.1487.6482.9973.1889.6984.480.95-
2025.11
82.7--------203.1
2025.11
82.5--------55
2026.05
82.29---------
2026.05
82---------
2025.11
81.7--------172.6
2026.05
81.55---------
2026.05
80.81---------
2025.11
80.6--------205.5
2025.11
80.3--------101.4
2025.11
80.1--------195.7
2026.04
79.9381.6270.648479.7770.886.6281.8676.53-
2025.11
79.9--------200.9
2025.11
79.5--------186.2
2025.11
79.2--------93.8
2025.11
78.8--------97.5
2026.05
78.6---------
2026.04
78.2181.0868.6280.0979.5268.9683.6980.8773.43-
2026.05
77.49---------
2025.11
77--------174.2
2025.11
76.8--------104.3
2026.04
75.0876.7867.0478.0974.0566.1979.7278.08--
2026.05
74.66---------
2025.11
74.4--------182.5
2026.04
73.4773.9866.3778.1871.6564.379.6576.5168.03-
2025.11
72.5--------195.7