| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Story Ending Generation | ROCStories (test) | BLEU-157.47 | 43 | |
| Multi-label emotion classification | ROCStories Plutchik emotion labels (test) | Precision67.29 | 11 | |
| Storytelling | RocStories 8:1:1 (test) | BLEU-10.3552 | 10 | |
| Unconditional Language Generation | ROCStories (test) | MAUVE0.951 | 9 | |
| Narrative Salience Detection | ROCStories (test) | Rho0.53 | 8 | |
| Conditional Text Generation | ROCStories (test) | UNION85.31 | 8 | |
| Story Cloze Test | ROCStories Story Cloze Test | Accuracy0.9133 | 8 | |
| Infilling | ROCStories | ROUGE-123.3 | 7 | |
| Story infilling | ROCStories (test) | BLEU-40.031 | 7 | |
| Narrative Incoherence Detection | ROCStories | Accuracy0.7503 | 7 | |
| Story Cloze Test | ROCStories (test) | Accuracy86.5 | 7 | |
| Language Modeling | ROCstories (test) | ELBO5.94 | 6 | |
| Story Generation | ROCStories 2016 | Repetition Score (rep-2)24.26 | 5 | |
| Conditional Text Generation | ROCStories | Grammaticality Win Rate36.2 | 5 | |
| Narrative Incoherence Detection | ROCStories (test) | ACC75.03 | 5 | |
| Machine Reading Comprehension | ROCStories (test) | Accuracy0.883 | 5 | |
| Length Control | ROCStories (test) | Control Success Rate100 | 4 | |
| Syntax Spans Control | ROCStories (test) | Control Score93.8 | 4 | |
| Syntax Tree Control | ROCStories (test) | Ctrl Score86 | 4 | |
| Parts-of-speech Control | ROCStories (test) | Control Score93 | 4 | |
| Reading Comprehension | ROCStories Spring 2016 (test) | Accuracy91.4 | 4 | |
| Story Generation Evaluation | ROCStories (test) | Fascination0.6909 | 2 | |
| Length Control | ROCStories | Control Score- | 0 | |
| Syntax Tree Control | ROCStories | Control Score- | 0 | |
| Parts-of-speech Control | ROCStories | Control Score- | 0 |