Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Downstream Accuracy on MATH500
Loading...
79.8
Accuracy
Victim-Trace (oracle)
40.592
50.771
60.95
71.129
Mar 7, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Victim-Trace (oracle)
Surrogate=-, Student M...
2026.03
79.8
Synthesized-Trace
Surrogate=R1-Distill,...
2026.03
71.8
No finetuning
Surrogate=-, Student M...
2026.03
71.2
Victim-Trace (oracle)
Surrogate=-, Student M...
2026.03
64.4
Surrogate-Trace
Surrogate=R1-Distill,...
2026.03
63.2
Answer+Summary
Surrogate=-, Student M...
2026.03
63
Answer-only
Surrogate=-, Student M...
2026.03
61
Synthesized-Trace
Surrogate=R1-Distill,...
2026.03
50.2
Surrogate-Trace
Surrogate=R1-Distill,...
2026.03
48.8
Answer+Summary
Surrogate=-, Student M...
2026.03
45.8
Answer-only
Surrogate=-, Student M...
2026.03
44
No finetuning
Surrogate=-, Student M...
2026.03
42.1
Feedback
Search any
task
Search any
task