Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Story

Benchmarks

Task NameDataset NameSOTA ResultTrend
Creative WritingStory
Semantic Diversity38.6
20
Open-ended Text GenerationStory (test)
Diversity (DIV)0.96
12
Machine Text DetectionStory
Rewrite AUC (Claude 3.5)0.998
11
Story generationStory
Quality122.6
9
Text GenerationStory
Coherence Win Rate63.6
4
Showing 5 of 5 rows