Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hierarchical Neural Story Generation

About

We explore story generation: creative systems that can build coherent and fluent passages of text about a topic. We collect a large dataset of 300K human-written stories paired with writing prompts from an online forum. Our dataset enables hierarchical story generation, where the model first generates a premise, and then transforms it into a passage of text. We gain further improvements with a novel form of model fusion that improves the relevance of the story to the prompt, and adding a new gated multi-scale self-attention mechanism to model long-range context. Experiments show large improvements over strong baselines on both automated and human evaluations. Human judges prefer stories generated by our approach to those from a strong non-hierarchical model by a factor of two to one.

Angela Fan, Mike Lewis, Yann Dauphin• 2018

Related benchmarks

TaskDatasetResultRank
Code GenerationHumanEval--
1036
Mathematical ReasoningGSM8K (test)
Accuracy81.43
900
Question AnsweringGPQA
Accuracy35.35
258
Arithmetic ReasoningGSM8K--
173
Question AnsweringCommonsenseQA
Accuracy82.75
148
Code GenerationHumanEval
Accuracy (%)54.88
77
Scientific ReasoningGPQA Main
Accuracy28.35
67
Mathematical ReasoningGSM8K (test)
Exact Match Accuracy (GSM8K Test)95.83
60
Mathematics Problem SolvingMATH500 (test)
Exact Match Accuracy61.2
60
Mathematical ReasoningMATH 500
Exact Match60
60
Showing 10 of 40 rows

Other info

Follow for update