Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on OlympiadBench (pass@1, pass@5)

0.1132Pass@1

Base Model

Updated 1mo ago

Evaluation Results

Method	Links
Base Model 2025.10		0.1132	0.1956
FA 2025.10		0.1121	0.2021
CAA 2025.10		0.1076	0.2047
ToT 2025.10		0.0919	0.1836
RS 2025.10		0.0642	0.1083
STaR 2025.10		0.0582	0.1062