DNA-DetectLLM: Unveiling AI-Generated Text via a DNA-Inspired Mutation-Repair Paradigm
About
The rapid advancement of large language models (LLMs) has blurred the line between AI-generated and human-written text. This progress brings societal risks such as misinformation, authorship ambiguity, and intellectual property concerns, highlighting the urgent need for reliable AI-generated text detection methods. However, recent advances in generative language modeling have resulted in significant overlap between the feature distributions of human-written and AI-generated text, blurring classification boundaries and making accurate detection increasingly challenging. To address the above challenges, we propose a DNA-inspired perspective, leveraging a repair-based process to directly and interpretably capture the intrinsic differences between human-written and AI-generated text. Building on this perspective, we introduce DNA-DetectLLM, a zero-shot detection method for distinguishing AI-generated and human-written text. The method constructs an ideal AI-generated sequence for each input, iteratively repairs non-optimal tokens, and quantifies the cumulative repair effort as an interpretable detection signal. Empirical evaluations demonstrate that our method achieves state-of-the-art detection performance and exhibits strong robustness against various adversarial attacks and input lengths. Specifically, DNA-DetectLLM achieves relative improvements of 5.55% in AUROC and 2.08% in F1 score across multiple public benchmark datasets. Code and data are available at https://github.com/Xiaoweizhu57/DNA-DetectLLM.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Machine-generated text detection | TruthfulQA | TPR@FPR-1% (ChatGLM)70.21 | 54 | |
| Machine-generated text detection | Xsum | AUROC99.35 | 40 | |
| Machine-generated text detection | Essay (test) | GPT4All Score3.01 | 39 | |
| AI-generated text detection | Essay | AUROC (GPT4All)54.7 | 35 | |
| Machine-generated text detection | WritingPrompts | -- | 30 | |
| AI-generated text detection | M4 | AUROC91.74 | 27 | |
| AI-generated text detection | RealDet | AUROC94.48 | 27 | |
| AI-generated text detection | DetectRL Multi-LLM | AUROC89.01 | 27 | |
| AI-generated text detection | DetectRL Multi-Domain | AUROC88.29 | 27 | |
| Machine-generated text detection | MGTEVAL SemEval 2024 human text & Qwen3 machine-generated text 1.0 (test) | Accuracy89.5 | 26 |