Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on AIME 2025 (Pass@32, Memory, Length)

96Pass@32

Vanilla

Updated 1mo ago

Evaluation Results

Method	Links
Vanilla 2025.12		96	-	-
MNL 2025.12		96	10	0
TFGO 2025.12		90	-	696
TFGO 2025.12		90	-	1,452
MNL 2025.12		83	9	60
Vanilla 2025.12		80	-	-
ACE 2025.12		67	163	21,318
MNL 2025.12		30	51	67
Memento 2025.12		27	100	3,100
Vanilla 2025.12		23	-	-
TFGO 2025.12		23	-	703
ACE 2025.12		10	100	7,355