| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Natural Language Inference | NLI adversarial benchmark (test) | Average Score75.4 | 18 | |
| Natural Language Inference | NLI | Accuracy91.2 | 14 | |
| Natural Language Inference | NLI ANLI and HANS (unseen) | ANLI Score32.4 | 9 | |
| Natural Language Inference | NLI domain average | Best Accuracy87.5 | 8 | |
| Natural Language Inference | NLI (test) | Relative CPU Speed2.89 | 2 |