Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hierarchical Neural Story Generation

About

We explore story generation: creative systems that can build coherent and fluent passages of text about a topic. We collect a large dataset of 300K human-written stories paired with writing prompts from an online forum. Our dataset enables hierarchical story generation, where the model first generates a premise, and then transforms it into a passage of text. We gain further improvements with a novel form of model fusion that improves the relevance of the story to the prompt, and adding a new gated multi-scale self-attention mechanism to model long-range context. Experiments show large improvements over strong baselines on both automated and human evaluations. Human judges prefer stories generated by our approach to those from a strong non-hierarchical model by a factor of two to one.

Angela Fan, Mike Lewis, Yann Dauphin• 2018

Related benchmarks

TaskDatasetResultRank
Code GenerationHumanEval--
1043
Mathematical ReasoningGSM8K (test)
Accuracy81.43
954
Visual Question AnsweringChartQA
Accuracy80.78
519
Arithmetic ReasoningGSM8K--
272
Question AnsweringGPQA
Accuracy35.35
258
Visual PerceptionBLINK
Accuracy39.77
241
Question AnsweringCommonsenseQA
Accuracy82.75
150
Scientific ReasoningGPQA Main
Accuracy28.35
101
Code GenerationHumanEval
Accuracy (%)54.88
77
Multimodal UnderstandingMMMU
Accuracy47.78
76
Showing 10 of 60 rows

Other info

Follow for update