Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Automated Theorem Proving on CombiBench Hard Mode
Loading...
10
Total Solved (Pass@32)
DAP
7.92
8.46
9
9.54
Apr 17, 2026
Total Solved (Pass@32)
Solution-style Solved (Pass@32)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Total Solved (Pass@32)
Solution-style Solved (Pass@32)
DAP
Agent Usage=w/o Agent,...
2026.04
10
2
DAP
Agent Usage=w/ Agent,...
2026.04
9
1
Kimina-Prover Preview
Pass@k=Pass@32
2026.04
8
-
Feedback
Search any
task
Search any
task