Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scientific Multimodal Reasoning on SFE

44.06Accuracy

GPT-5

24.622429.668734.71539.7613Apr 23, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
44.06
2026.04
43.98
2026.04
43.1
43
2026.04
42.58
2026.04
39.98
37.6
2026.04
37.5
2026.04
36.93
2026.04
25.37