Raidar: geneRative AI Detection viA Rewriting

About

We find that large language models (LLMs) are more likely to modify human-written text than AI-generated text when tasked with rewriting. This tendency arises because LLMs often perceive AI-generated text as high-quality, leading to fewer modifications. We introduce a method to detect AI-generated content by prompting LLMs to rewrite text and calculating the editing distance of the output. We dubbed our geneRative AI Detection viA Rewriting method Raidar. Raidar significantly improves the F1 detection scores of existing AI content detection models -- both academic and commercial -- across various domains, including News, creative writing, student essays, code, Yelp reviews, and arXiv papers, with gains of up to 29 points. Operating solely on word symbols without high-dimensional features, our method is compatible with black box LLMs, and is inherently robust on new content. Our results illustrate the unique imprint of machine-generated text through the lens of the machines themselves.

Chengzhi Mao, Carl Vondrick, Hao Wang, Junfeng Yang• 2024

Related benchmarks

Task	Dataset	Result
AI-generated text detection	AcademicResearch	AUC82.1	36
Machine-generated text detection	MGTEVAL SemEval 2024 human text & Qwen3 machine-generated text 1.0 (test)	Accuracy98	26
AI-generated text detection	PersonalCommunication	AUC0.782	24
AI-generated text detection	Environmental	AUC89.1	24
AI-generated text detection	LiteratureCreativeWriting	AUC93.2	24
AI-generated text detection	TechnicalWriting	AUC92.7	24
AI-generated text detection	Entertainment	AUC0.927	24
AI-generated text detection	FoodCusine	AUC0.791	24
AI-generated text detection	OnlineContent	AUC78.6	24
AI-generated text detection	TravelTourism	AUC85.1	24

Showing 10 of 69 rows

Other info

Follow for update

@wizwand_team Discord