Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Formal Theorem Proving on STP In-domain
Loading...
2,604.24
Average Token Cost
Segment-level
2,221.6476
4,804.1463
7,386.645
9,969.1437
May 12, 2026
Average Token Cost
Average Time Cost (s)
Proof Success Rate
Updated 21d ago
Evaluation Results
Method
Method
Links
Average Token Cost
Average Time Cost (s)
Proof Success Rate
Segment-level
Training data=STP
2026.05
2,604.24
41.55
-
Whole-proof-seg
Training data=STP
2026.05
2,906.41
50.52
-
Step-level
Training data=STP
2026.05
4,768.51
73.5
-
Whole-proof
Training data=STP
2026.05
12,169.05
42.67
-
Step-level
Training data=STP, Bas...
2026.05
-
-
97.8
Whole-proof
Training data=STP, Bas...
2026.05
-
-
98.12
Whole-proof-seg
Training data=STP, Bas...
2026.05
-
-
95.72
Segment-level
Training data=STP, Bas...
2026.05
-
-
97.32
Feedback
Search any
task
Search any
task