Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on AMC23 (Length and Retention Analysis)
Loading...
10.9
Full Length
Minimal-core extraction
9.756
10.053
10.35
10.647
May 14, 2026
Full Length
Core Length
CR
RM
Top-3 Mass
Retention
Updated 19d ago
Evaluation Results
Method
Method
Links
Full Length
Core Length
CR
RM
Top-3 Mass
Retention
Minimal-core extraction
Model=GPT-5
2026.05
10.9
5.2
48
52
69
90
Minimal-core extraction
Model=DeepSeek-R1-Dist...
2026.05
10.6
5.5
52
48
65
87
Minimal-core extraction
Model=Qwen3-32B
2026.05
10
5.7
57
43
61
85
Minimal-core extraction
Model=DeepSeek-R1-Dist...
2026.05
9.8
6
61
39
58
81
Feedback
Search any
task
Search any
task