Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scientific Multimodal Reasoning on Galaxy-10

57.72Accuracy

GPT-5

14.102425.426236.7548.0738Apr 23, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
57.72
2026.04
51.52
2026.04
48.37
2026.04
42.45
2026.04
39.06
2026.04
33.93
2026.04
30.44
2026.04
27.84
25.14
15.78