Share your thoughts, 1 month free Claude Pro on usSee more

Expert Scientific Reasoning on GPQA-D

8.7Full Length

Minimal-core extraction

Updated 2mo ago

Evaluation Results

Method	Links
Minimal-core extraction 2026.05		8.7	4.4	51	49	66	84
Minimal-core extraction 2026.05		8.3	4.7	57	43	62	82
Minimal-core extraction 2026.05		8.1	4.8	59	41	60	81
Minimal-core extraction 2026.05		7.8	5	64	36	55	76