Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scientific Question Answering on GPQA (Accuracy (%), Δ)

82.4Accuracy

GPT-5

62.6467.7772.978.03Aug 26, 2025
Updated 5d ago

Evaluation Results

MethodLinks
2025.08
82.43.1
2025.08
80.1-
2025.08
79.94.5
2025.08
79.5-0.6
2025.08
79.2-
2025.08
75.4-
2025.08
74.65.2
2025.08
73.910.5
2025.08
69.4-
2025.08
695.2
2025.08
63.8-
2025.08
63.4-