Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

WritingPrompts

Benchmarks

Task NameDataset NameSOTA ResultTrend
LGT DetectionWritingPrompts small Fast-DetectGPT benchmark (test)
AUROC99.9
54
LGT DetectionWritingPrompts-small Fast-DetectGPT benchmark
AUROC99.9
54
Machine-generated text detectionWritingPrompts
AUROC1
30
LLM Text AttributionWritingPrompts
TPR (FPR=0.01)100
18
Watermark DetectionWritingPrompts English (test)
TPR@FPR5%98.8
15
Language ModelingWritingPrompts (test)
Diversity (div)88
14
Text generationWritingPrompts
F1 Score22.11
10
Open-ended Text GenerationWritingPrompts
PPL1.76
10
Text GenerationWritingPrompts (WP) (test)
BLEU-10.224
10
Narrative Script RefinementWritingPrompts
Character Development20.64
8
Output Sequence Length PredictionWritingPrompts super-long sequences (> 17k tokens) OOD
MAE195.89
8
LLM-generated text detectionWritingPrompts Fast-DetectGPT
AUROC98.8
5
Story Generation EvaluationWritingPrompts (WP) (test)
Fascination73.88
2
Open-ended Text GenerationWritingPrompts (test)
Same Count85
2
Showing 14 of 14 rows