Share your thoughts, 1 month free Claude Pro on usSee more

Compositional Reasoning on HLE

23.1Accuracy

RCE

Updated 5mo ago

Evaluation Results

Method	Links
RCE 2026.02		23.1
RCE 2026.02		20.2
RCE 2026.02		18.7
Base 2026.02		14.3
DisCO 2026.02		13.8
GRPO 2026.02		12.6
ToT 2026.02		11.9
SC 2026.02		11.4
CoT 2026.02		10.1
Base 2026.02		9.6
Base 2026.02		8.2