Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SeqXGPT: Sentence-Level AI-Generated Text Detection

About

Widely applied large language models (LLMs) can generate human-like content, raising concerns about the abuse of LLMs. Therefore, it is important to build strong AI-generated text (AIGT) detectors. Current works only consider document-level AIGT detection, therefore, in this paper, we first introduce a sentence-level detection challenge by synthesizing a dataset that contains documents that are polished with LLMs, that is, the documents contain sentences written by humans and sentences modified by LLMs. Then we propose \textbf{Seq}uence \textbf{X} (Check) \textbf{GPT}, a novel method that utilizes log probability lists from white-box LLMs as features for sentence-level AIGT detection. These features are composed like \textit{waves} in speech processing and cannot be studied by LLMs. Therefore, we build SeqXGPT based on convolution and self-attention networks. We test it in both sentence and document-level detection challenges. Experimental results show that previous methods struggle in solving sentence-level AIGT detection, while our method not only significantly surpasses baseline methods in both sentence and document-level detection challenges but also exhibits strong generalization capabilities.

Pengyu Wang, Linyang Li, Ke Ren, Botian Jiang, Dong Zhang, Xipeng Qiu• 2023

Related benchmarks

TaskDatasetResultRank
AI-generated text detectionCross-genre (test)
OA88
32
AIGT detectionHC3 PWWS attack, AI to Human (in-domain)
Overall Accuracy99.75
28
AI-generated text detectionmixed-source AI -> Human GPT-2, GPT-Neo, GPT-J, LLaMa, GPT-3
Overall Accuracy96.5
26
AI-generated text detectionHC3 (test)
F1 (Overall)99.33
18
Authorship segmentationMAS (entire corpus)
SBDA @ 0.331.5
18
AI-generated text detectionCross-genre AIGT Overall (test)
OA86.75
14
AIGT detectionHC3 PWWS attack, Human to AI (in-domain)
OA100
14
AIGT detectionHC3 Deep-Word-Bug attack Human to AI (in-domain)
Overall Accuracy100
14
AIGT detectionHC3 Pruthi attack Human to AI (in-domain)
OA1
14
AIGT detectioncross-domain AIGT detection AI -> Human
Overall Accuracy93
14
Showing 10 of 21 rows

Other info

Follow for update