Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Expert-level Science Reasoning on GPQA

18.81LLMcritic Calls

VecCISC + HAC

10.2312.457514.68516.9125May 8, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.05
18.81-5.94
2026.05
17.95-10.24
2026.05
16.41-17.93
2026.05
16-20.02
2026.05
10.56-47.19