Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on AIME24 (Full Len., Core Len., CR, RM, Top-3 Mass, Retention)

11.4Full Length Score

Minimal-core extraction

Updated 2mo ago

Evaluation Results

Method	Links
Minimal-core extraction 2026.05		11.4	6	53	47	64	85
Minimal-core extraction 2026.05		10.9	6.3	58	42	59	83
Minimal-core extraction 2026.05		10.5	6.5	62	38	56	81
Minimal-core extraction 2026.05		10.2	6.7	66	34	53	77