| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| General NLP Evaluation | Natural Language Benchmarks Aggregate | Average Score62.31 | 30 | |
| Constrained Text Generation | Natural Language Benchmarks Word Length constraint | Saturation95 | 3 | |
| Constrained Text Generation | Natural Language Benchmarks Between (ubd.) constraint | Saturation93.8 | 3 | |
| Constrained Text Generation | Natural Language Benchmarks Between-n constraint | Saturation85.5 | 3 | |
| Constrained Text Generation | Natural Language Benchmarks Appearance constraint | Saturation92.5 | 3 | |
| Constrained Text Generation | Natural Language Benchmarks Suffix constraint | Saturation96.8 | 3 | |
| Constrained Text Generation | Natural Language Benchmarks Prefix constraint | Saturation95.7 | 3 |