Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Theorem Proving on ProofNet (test)

47.3Pass@32

LongCat-Flash-Prover

10.3819.96529.5539.135Mar 22, 2026
Updated 25d ago

Evaluation Results

MethodLinks
2026.03
47.3-----
2026.03
36.1-----
2026.03
35-----
2026.03
30.5-----
2026.03
23.7-----
2026.03
23-----
2026.03
22-----
2026.03
22-----
2026.03
20.4-----
2026.03
19.9-----
2026.03
19.9-----
2026.03
18.3-----
2026.03
16.7-----
2026.03
13.4-----
2026.03
11.8-----
2025.04
-9.6----
2025.04
-11.3----
2025.04
-10.172---
2025.04
-12.82131.25-
2025.04
-13.562.4---
2025.04
-14.121.80.7521.74-
2025.04
-11.86----
2025.04
-11.86----
2025.04
-13.561.83---
2025.04
-14.691.83128.57-
2025.04
-13.562---
2025.04
-15.251.670.8426.09-
2026.03
-----37.1
2026.03
-----26.9
2026.03
-----18.2
2026.03
-----25.2
2026.03
-----51.1
2026.03
-----52.2