Hierarchical Neural Story Generation

About

We explore story generation: creative systems that can build coherent and fluent passages of text about a topic. We collect a large dataset of 300K human-written stories paired with writing prompts from an online forum. Our dataset enables hierarchical story generation, where the model first generates a premise, and then transforms it into a passage of text. We gain further improvements with a novel form of model fusion that improves the relevance of the story to the prompt, and adding a new gated multi-scale self-attention mechanism to model long-range context. Experiments show large improvements over strong baselines on both automated and human evaluations. Human judges prefer stories generated by our approach to those from a strong non-hierarchical model by a factor of two to one.

Angela Fan, Mike Lewis, Yann Dauphin• 2018

Related benchmarks

Task	Dataset	Result
Code Generation	HumanEval	--	1043
Mathematical Reasoning	GSM8K (test)	Accuracy81.43	954
Visual Question Answering	ChartQA	Accuracy80.78	519
Arithmetic Reasoning	GSM8K	--	272
Question Answering	GPQA	Accuracy35.35	258
Visual Perception	BLINK	Accuracy39.77	241
Question Answering	CommonsenseQA	Accuracy82.75	150
Scientific Reasoning	GPQA Main	Accuracy28.35	101
Code Generation	HumanEval	Accuracy (%)54.88	77
Multimodal Understanding	MMMU	Accuracy47.78	76

Showing 10 of 60 rows

Other info

Follow for update

@wizwand_team Discord