Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Writing

Benchmarks

Task NameDataset NameSOTA ResultTrend
AI-generated text detectionWriting Generated by Claude3 (test)
AUROC99.5
15
AI-generated text detectionWriting Generated by GPT-4 (test)
AUROC0.9768
15
AI-generated text detectionWriting Generated by ChatGPT (test)
AUROC0.9916
15
Idea GenerationWriting
Ideas Accepted1,000
3
Downstream classificationWriting Unconstrained
F1 Score22.1
3
Downstream classificationWriting Category-controlled top-K
F1 Score14.2
3
Showing 6 of 6 rows