| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Logical Reasoning | ProofWriter (test) | Accuracy92.32 | 36 | |
| Logical Reasoning | ProofWriter | Accuracy98.4 | 32 | |
| Logical Reasoning | ProofWriter | Accuracy99.7 | 24 | |
| Deductive Reasoning | ProofWriter | Pass@197.4 | 18 | |
| Explanation Refinement | ProofWriter | Initial Score92 | 15 | |
| Reasoning | ProofWriter | Accuracy65 | 14 | |
| Logical Reasoning | ProofWriter (held-out) | Performance0.5483 | 14 | |
| Deductive logical reasoning | ProofWriter (test) | ExcRate100 | 12 | |
| Deductive Reasoning | ProofWriter | Calibrated Accuracy92.1 | 8 | |
| Deductive logical reasoning | ProofWriter 600 records (test) | Exc. Rate- | 0 |