| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Conclusion Generation | EntailmentBank (test) | BLEU67 | 26 | |
| Natural Language Inference | EntailmentBank (test) | BLEU54 | 20 | |
| Explanation Refinement | EntailmentBank | Initial Score25.33 | 15 | |
| Explanatory Inference | EntailmentBank | BLEU57 | 12 | |
| Entailment Tree Generation | EntailmentBank Task 3 (Full Unseen) | Leaves F147.1 | 10 | |
| Entailment Tree Generation | EntailmentBank Task 2 (Distractors) | Leaves F190.3 | 6 | |
| Entailment Tree Generation | EntailmentBank Task 1 (No Distractors) | Leaves F1100 | 6 | |
| Entailment tree generation | EntailmentBank (test) | Leaves F145.6 | 5 | |
| Entailment tree generation | EntailmentBank 50 samples (test) | FV100 | 4 | |
| Question Answering | EntailmentBankQA Easy (test) | Answer Accuracy70.8 | 3 | |
| Entailment Reasoning | EntailmentBank | Accuracy81.8 | 2 |