Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Theorem Proving on DeepTheorem

54False Rate

DeepSeek-V3.2-Thinking (Agentic)

52.9659.986774.02Jan 24, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
541818
581614
622222
2026.01
723226
2026.01
764236
762222
2026.01
762016
803632