Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RAID

Benchmarks

Task NameDataset NameSOTA ResultTrend
AI-generated text detectionRAID (test)
TP @ 20% Error Threshold78.4
42
AI-text detector attack effectivenessRAID (evaluation)
MAGE ASR96
22
Machine-text detectionRAID-MovieReviews
AUROC0.96
21
Machine-text detectionRAID ArXiv
AUROC99
21
AI-generated text detectionRAID GPT-4 classic (test)
TPR @ 0.1% Error30.79
14
AI-generated text detectionRAID ChatGPT Classic (test)
TPR@0.1%70.77
14
AI-generated text detectionRAID
AUROC85.21
14
AI-text detector evasionRAID
ASR (τ=0.5)98.3
10
Machine Text DetectionRAID
AUC0.954
10
AI-text detectionRAID No attack clean (test)
AUROC99.97
9
Machine-generated text detectionRAID (test)
Abstracts FPR0.2
9
AI-text detectionRAID All settings with attack (test)
AUROC99.87
8
LLM-generated text detectionRAID Wikipedia-related samples
GPT-4 Performance Score99.35
8
LLM-generated text detectionRAID Wikipedia Paraphrased Phi-4
ROC AUC0.8675
8
LLM-generated text detectionRAID Wikipedia Paraphrased Grok-3-mini
ROC AUC0.8906
8
LLM-generated text detectionRAID Wikipedia Paraphrased DeepSeek-V3-0324
ROC AUC0.8926
8
LLM-generated text detectionRAID Wikipedia Paraphrased GPT-4.1
ROC AUC0.9173
8
LLM-generated text detectionRAID Wikipedia Paraphrased GPT-4o-mini
ROC AUC0.9073
8
LLM-generated text detectionRAID Wikipedia-related (all)
GPT-4 Score89.12
8
LLM-generated text detectionRAID Wikipedia-related
GPT-4 Score99.94
8
Composite Text DetectionRAID Paraphrase and Revise
ROC AUC89.84
8
Composite Text DetectionRAID Human and Revise
ROC AUC0.7907
8
Composite Text DetectionRAID Human and Paraphrase
ROC AUC0.7866
8
LLM-generated text detectionRAID Reviews
ROC AUC1
8
LLM-generated text detectionRAID Reddit
ROC AUC99.92
8
Showing 25 of 43 rows