Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

RAID

Benchmarks

Task NameDataset NameSOTA ResultTrend
AI-generated text detectionRAID (test)
TP @ 20% Error Threshold78.4
42
Machine-text detectionRAID-MovieReviews
AUROC0.96
21
Machine-text detectionRAID ArXiv
AUROC99
21
Machine Text DetectionRAID
AUC0.954
10
LLM-generated text detectionRAID Wikipedia-related samples
GPT-4 Performance Score99.35
8
LLM-generated text detectionRAID Wikipedia Paraphrased Phi-4
ROC AUC0.8675
8
LLM-generated text detectionRAID Wikipedia Paraphrased Grok-3-mini
ROC AUC0.8906
8
LLM-generated text detectionRAID Wikipedia Paraphrased DeepSeek-V3-0324
ROC AUC0.8926
8
LLM-generated text detectionRAID Wikipedia Paraphrased GPT-4.1
ROC AUC0.9173
8
LLM-generated text detectionRAID Wikipedia Paraphrased GPT-4o-mini
ROC AUC0.9073
8
LLM-generated text detectionRAID Wikipedia-related (all)
GPT-4 Score89.12
8
LLM-generated text detectionRAID Wikipedia-related
GPT-4 Score99.94
8
Composite Text DetectionRAID Paraphrase and Revise
ROC AUC89.84
8
Composite Text DetectionRAID Human and Revise
ROC AUC0.7907
8
Composite Text DetectionRAID Human and Paraphrase
ROC AUC0.7866
8
LLM-generated text detectionRAID Reviews
ROC AUC1
8
LLM-generated text detectionRAID Reddit
ROC AUC99.92
8
LLM-generated text detectionRAID Recipe
ROC AUC0.9999
8
LLM-generated text detectionRAID Poetry
ROC AUC100
8
LLM-generated text detectionRAID Abstract
ROC AUC100
8
LLM-generated text detectionRAID Books
ROC AUC100
8
LLM-generated text detectionRAID News
ROC AUC100
8
AI Text DetectionRAID (in-domain)
Accuracy95.98
5
Machine-Generated Text DetectionRAID
TP @ 20%78.5
4
Showing 24 of 24 rows