Hierarchical Neural Story Generation
About
We explore story generation: creative systems that can build coherent and fluent passages of text about a topic. We collect a large dataset of 300K human-written stories paired with writing prompts from an online forum. Our dataset enables hierarchical story generation, where the model first generates a premise, and then transforms it into a passage of text. We gain further improvements with a novel form of model fusion that improves the relevance of the story to the prompt, and adding a new gated multi-scale self-attention mechanism to model long-range context. Experiments show large improvements over strong baselines on both automated and human evaluations. Human judges prefer stories generated by our approach to those from a strong non-hierarchical model by a factor of two to one.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Code Generation | HumanEval | -- | 1036 | |
| Mathematical Reasoning | GSM8K (test) | Accuracy81.43 | 900 | |
| Question Answering | GPQA | Accuracy35.35 | 258 | |
| Arithmetic Reasoning | GSM8K | -- | 173 | |
| Question Answering | CommonsenseQA | Accuracy82.75 | 148 | |
| Code Generation | HumanEval | Accuracy (%)54.88 | 77 | |
| Scientific Reasoning | GPQA Main | Accuracy28.35 | 67 | |
| Mathematical Reasoning | GSM8K (test) | Exact Match Accuracy (GSM8K Test)95.83 | 60 | |
| Mathematics Problem Solving | MATH500 (test) | Exact Match Accuracy61.2 | 60 | |
| Mathematical Reasoning | MATH 500 | Exact Match60 | 60 |