Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ProofW

Benchmarks

Task NameDataset NameSOTA ResultTrend
Logical ReasoningProofW
Accuracy83.7
80
Logical ReasoningProofW
Accuracy77.5
11
Showing 2 of 2 rows