Share your thoughts, 1 month free Claude Pro on usSee more

RAID

Benchmarks

Task Name	Dataset Name	SOTA Result
AI-generated text detection	RAID (test)	TP @ 20% Error Threshold78.4	42
AI-text detector attack effectiveness	RAID (evaluation)	MAGE ASR96	22
Machine-text detection	RAID-MovieReviews	AUROC0.96	21
Machine-text detection	RAID ArXiv	AUROC99	21
AI-generated text detection	RAID GPT-4 classic (test)	TPR @ 0.1% Error30.79	14
AI-generated text detection	RAID ChatGPT Classic (test)	TPR@0.1%70.77	14
AI-generated text detection	RAID	AUROC85.21	14
AI-generated Image Detection	RAID (test)	Accuracy94.1	13
Detection and Calibration	RAID Reddit domain	AUC@1%77.5	10
Detection and Calibration	RAID News domain	AUC @ 1%0.93	10
AI-text detector evasion	RAID	ASR (τ=0.5)98.3	10
Machine Text Detection	RAID	AUC0.954	10
AI-text detection	RAID No attack clean (test)	AUROC99.97	9
Machine-generated text detection	RAID (test)	Abstracts FPR0.2	9
Machine-Generated Text Detection	RAID 1.0 (test)	AUROC (ChatGPT)99.44	8
AI-text detection	RAID All settings with attack (test)	AUROC99.87	8
LLM-generated text detection	RAID Wikipedia-related samples	GPT-4 Performance Score99.35	8
LLM-generated text detection	RAID Wikipedia Paraphrased Phi-4	ROC AUC0.8675	8
LLM-generated text detection	RAID Wikipedia Paraphrased Grok-3-mini	ROC AUC0.8906	8
LLM-generated text detection	RAID Wikipedia Paraphrased DeepSeek-V3-0324	ROC AUC0.8926	8
LLM-generated text detection	RAID Wikipedia Paraphrased GPT-4.1	ROC AUC0.9173	8
LLM-generated text detection	RAID Wikipedia Paraphrased GPT-4o-mini	ROC AUC0.9073	8
LLM-generated text detection	RAID Wikipedia-related (all)	GPT-4 Score89.12	8
LLM-generated text detection	RAID Wikipedia-related	GPT-4 Score99.94	8
Composite Text Detection	RAID Paraphrase and Revise	ROC AUC89.84	8

Showing 25 of 48 rows