Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on ScienceQA

98.45Natural Science Accuracy

MG2-RAG

73.001279.608186.21592.8219Apr 4, 2026
Updated 11d ago

Evaluation Results

MethodLinks
2026.04
98.4595.2898.7398.4496.4398.6898.4296.8497.85
2026.04
98.2795.1697.1897.996.4397.4997.7696.5797.34
2026.04
97.7894.6597.0697.9396.2497.0597.3396.4397
2026.04
97.693.0595.9697.7395.3596.1796.5495.8896.3
2026.04
93.5394.1290.8192.9894.2590.2794.0390.9392.9
2026.04
90.8187.9687.0989.9386.6188.9290.6486.7589.25
2026.04
90.2384.9787.4889.687.588.191.5982.4288.4
2026.04
88.7788.8683.8287.3988.0586.0689.9483.1287.5
2026.04
88.3781.2191.278778.2894.0188.8485.4387.62
2026.04
85.6175.9390.2784.474.1792.3385.7982.9884.77
2026.04
85.4872.4490.2782.6571.4992.8986.6679.0483.99
2026.04
85.1781.3385.8283.4376.189.4186.1281.6784.53
2026.04
84.1575.1487.6482.9973.1889.6984.480.9583.16
2026.04
81.6270.648479.7770.886.6281.8676.5379.93
2026.04
81.0868.6280.0979.5268.9683.6980.8773.4378.21
2026.04
76.7867.0478.0974.0566.1979.7278.08-75.08
2026.04
73.9866.3778.1871.6564.379.6576.5168.0373.47