Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reasoning on GPQA (Error %, ErrorGap %, STP %)

10.82Error Rate (%)

G-PAC

10.789210.997111.20511.4129Jan 30, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
10.82019.6
2026.01
10.86017.07
2026.01
11.24308.73
2026.01
11.34030.54
2026.01
11.57034.21
2026.01
11.59013.78