Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Theorem Proving on CombiBench

8Proof Length

Claude 4.6 Opus

4.89225.87146.8567.829Apr 29, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
8
2026.04
15.8
2026.04
22.3
2026.04
23.8
2026.04
29.6
2026.04
31.3
2026.04
37.4
2026.04
41.4
2026.04
45.3
2026.04
63.4
2026.04
76.5
2026.04
76.7
2026.04
85.7