Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on AIME 2024 (Pass@32, Mem, Len)

93Pass@32

TFGO

Updated 1mo ago

Evaluation Results

Method	Links
TFGO 2025.12		93	-	696
Vanilla 2025.12		93	-	-
MNL 2025.12		93	10	0
MNL 2025.12		90	9	60
TFGO 2025.12		90	-	1,452
Vanilla 2025.12		87	-	-
ACE 2025.12		80	163	21,318
MNL 2025.12		33	51	67
Vanilla 2025.12		30	-	-
ACE 2025.12		27	100	7,355
TFGO 2025.12		23	-	703
Memento 2025.12		20	100	3,100