Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Story

Benchmarks

Task NameDataset NameSOTA ResultTrend
Creative WritingStory
Semantic Diversity38.6
20
Story generationStory
Diversity8.36
19
Single change-point detectionStory
WD0.207
12
Open-ended Text GenerationStory (test)
Diversity (DIV)0.96
12
Machine Text DetectionStory
Rewrite AUC (Claude 3.5)0.998
11
Multiple change-point detectionStory dataset GPT-5-mini K=5
WD0.44
6
Text GenerationStory
Coherence Win Rate63.6
4
Showing 7 of 7 rows