Hierarchical Neural Story Generation
About
We explore story generation: creative systems that can build coherent and fluent passages of text about a topic. We collect a large dataset of 300K human-written stories paired with writing prompts from an online forum. Our dataset enables hierarchical story generation, where the model first generates a premise, and then transforms it into a passage of text. We gain further improvements with a novel form of model fusion that improves the relevance of the story to the prompt, and adding a new gated multi-scale self-attention mechanism to model long-range context. Experiments show large improvements over strong baselines on both automated and human evaluations. Human judges prefer stories generated by our approach to those from a strong non-hierarchical model by a factor of two to one.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Code Generation | HumanEval | -- | 1043 | |
| Mathematical Reasoning | GSM8K (test) | Accuracy81.43 | 954 | |
| Visual Question Answering | ChartQA | Accuracy80.78 | 519 | |
| Arithmetic Reasoning | GSM8K | -- | 272 | |
| Question Answering | GPQA | Accuracy35.35 | 258 | |
| Visual Perception | BLINK | Accuracy39.77 | 241 | |
| Question Answering | CommonsenseQA | Accuracy82.75 | 150 | |
| Scientific Reasoning | GPQA Main | Accuracy28.35 | 101 | |
| Code Generation | HumanEval | Accuracy (%)54.88 | 77 | |
| Multimodal Understanding | MMMU | Accuracy47.78 | 76 |