| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| AI-generated text detection | DetectRL Multi-Domain | AUROC96.19 | 27 | |
| AI-generated text detection | DetectRL Multi-LLM | AUROC97.17 | 27 | |
| LLM-generated text detection | DetectRL Out-of-Domain Multi-Topic 1.0 (test) | Average Detection Score91.1 | 18 | |
| LLM-generated text detection | DetectRL Out-of-Domain Multi-LLM 1.0 (test) | Average Performance Score90.6 | 16 | |
| Machine-generated text detection | DetectRL Multi-LLM (in-domain) | Score (GPT-3.5)99.7 | 14 | |
| Machine-generated text detection | DetectRL Multi-Topic (in-domain) | arXiv Score1 | 14 | |
| LLM-generated text detection | DetectRL | AUROC (Multi-Domain)97.97 | 12 | |
| Binary AIGC Detection | DetectRL | Accuracy97.2 | 12 | |
| Machine-generated text detection | DetectRL Training Text: Llama-2-70b (test) | Detection Score (Llama-2-70b)90.2 | 12 | |
| Machine-Generated Text Detection | DetectRL (test) | Detection Score (Llama-2-70b)50.56 | 12 | |
| Machine-Generated Text Detection | DetectRL Google-PaLM (train) | TPR@FPR-1% (Llama-2-70b)50.58 | 12 | |
| Machine-Generated Text Detection | DetectRL Training Text: ChatGPT | TPR@FPR-1% (Llama-2-70b)50.66 | 12 | |
| Machine-generated text detection | DetectRL-arXiv cross-source corruption (test) | AUROC93.86 | 9 | |
| Machine-generated text detection | DetectRL Google-PaLM | AUROC77.8 | 6 | |
| Machine-generated text detection | DetectRL Llama-2-70b | AUROC0.8122 | 6 | |
| LLM-generated text detection | DetectRL Word-level perturbation (OOD) | AUROC99.59 | 3 | |
| LLM-generated text detection | DetectRL Character-level perturbation (OOD) | AUROC99.8 | 3 | |
| LLM-generated text detection | DetectRL Yelp_review domain | AUROC90.18 | 3 | |
| LLM-generated text detection | DetectRL WritingPrompts domain | AUROC81.32 | 3 | |
| LLM-generated text detection | DetectRL Mixed-domain Source: GPT-3.5-turbo (test) | AUROC80.57 | 3 | |
| LLM-generated text detection | DetectRL Mixed-domain Source: Claude-instant (test) | AUROC80.92 | 3 | |
| LLM-generated text detection | DetectRL Mixed-domain Source: Llama-2-70B (test) | AUROC94.7 | 3 | |
| Machine-Generated Text Detection | DetectRL Back-translation paraphrase (test) | AUROC99.79 | 3 | |
| Machine-Generated Text Detection | DetectRL DIPPER paraphrase (test) | AUROC99.45 | 3 | |
| Machine-Generated Text Detection | DetectRL Polish paraphrase (test) | AUROC99.21 | 3 |