Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ProverQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Deductive logical reasoningProverQA hard (test)
Error Rate0
12
Deductive logical reasoningProverQA OOD hard subset 500 records (test)
Error Rate-
0
Showing 2 of 2 rows