Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scientific Question Answering on SuperGPQA*

62.4Accuracy

GPT-5

39.62445.53751.4557.363Aug 26, 2025
Updated 5d ago

Evaluation Results

MethodLinks
2025.08
62.43.8
2025.08
60.40.3
2025.08
60.1-
2025.08
59.54.6
2025.08
58.6-
2025.08
57.18.5
2025.08
54.9-
2025.08
5413.5
2025.08
49.84.6
2025.08
48.6-
2025.08
45.2-
2025.08
40.5-