Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DetectRL

Benchmarks

Task NameDataset NameSOTA ResultTrend
AI-generated text detectionDetectRL Multi-Domain
AUROC96.19
27
AI-generated text detectionDetectRL Multi-LLM
AUROC97.17
27
LLM-generated text detectionDetectRL Out-of-Domain Multi-Topic 1.0 (test)
Average Detection Score91.1
18
LLM-generated text detectionDetectRL Out-of-Domain Multi-LLM 1.0 (test)
Average Performance Score90.6
16
Machine-generated text detectionDetectRL Multi-LLM (in-domain)
Score (GPT-3.5)99.7
14
Machine-generated text detectionDetectRL Multi-Topic (in-domain)
arXiv Score1
14
LLM-generated text detectionDetectRL
AUROC (Multi-Domain)97.97
12
Binary AIGC DetectionDetectRL
Accuracy97.2
12
Machine-generated text detectionDetectRL Training Text: Llama-2-70b (test)
Detection Score (Llama-2-70b)90.2
12
Machine-Generated Text DetectionDetectRL (test)
Detection Score (Llama-2-70b)50.56
12
Machine-Generated Text DetectionDetectRL Google-PaLM (train)
TPR@FPR-1% (Llama-2-70b)50.58
12
Machine-Generated Text DetectionDetectRL Training Text: ChatGPT
TPR@FPR-1% (Llama-2-70b)50.66
12
Machine-generated text detectionDetectRL-arXiv cross-source corruption (test)
AUROC93.86
9
Machine-generated text detectionDetectRL Google-PaLM
AUROC77.8
6
Machine-generated text detectionDetectRL Llama-2-70b
AUROC0.8122
6
LLM-generated text detectionDetectRL Word-level perturbation (OOD)
AUROC99.59
3
LLM-generated text detectionDetectRL Character-level perturbation (OOD)
AUROC99.8
3
LLM-generated text detectionDetectRL Yelp_review domain
AUROC90.18
3
LLM-generated text detectionDetectRL WritingPrompts domain
AUROC81.32
3
LLM-generated text detectionDetectRL Mixed-domain Source: GPT-3.5-turbo (test)
AUROC80.57
3
LLM-generated text detectionDetectRL Mixed-domain Source: Claude-instant (test)
AUROC80.92
3
LLM-generated text detectionDetectRL Mixed-domain Source: Llama-2-70B (test)
AUROC94.7
3
Machine-Generated Text DetectionDetectRL Back-translation paraphrase (test)
AUROC99.79
3
Machine-Generated Text DetectionDetectRL DIPPER paraphrase (test)
AUROC99.45
3
Machine-Generated Text DetectionDetectRL Polish paraphrase (test)
AUROC99.21
3
Showing 25 of 26 rows