Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

READER: Reasoning-Enhanced AI-Generated Text Detection

About

Recent advances in large language models (LLMs) have made it increasingly difficult to distinguish human-written text from AI-generated content. Many existing detectors train supervised neural classifiers that achieve strong in-distribution performance but are often opaque and can degrade substantially under distribution shift. We present READER, a reasoning-enhanced AI text detector that outputs both a human/AI label and a structured rationale describing the evidence for its decision. A key component of our approach is READ, a curated supervision set of rationales and verdicts. We fine-tune an LLM on READ to build READER, which reasons before detecting at inference time. Despite having only 1.5B parameters, READER consistently outperforms existing detectors as well as prompted, high-capacity LLM baselines (GPT-5.2, Gemini-3-Pro, and DeepSeek-V3.2), which are 100 to 1000 times larger in scale.

Pingfan Su, Kai Ye, Shijin Gong, Erhan Xu, Jin Zhu, Giulia Livieri, Chengchun Shi• 2026

Related benchmarks

TaskDatasetResultRank
AI-generated text detectionREAD (test)
Accuracy95.3
55
Out-of-Distribution DetectionOOD Detection Source LLM: Gemini-2.5-Flash (test)
XSUM Score96
19
Out-of-Distribution DetectionOOD Detection Source LLM: Claude-3.5-Haiku (test)
XSUM0.977
19
Out-of-Distribution DetectionOOD Detection Source LLM GPT-4o (test)
XSUM Score98.7
19
Out-of-distribution AI-generated text detectionGrok Out-of-distribution (OOD) unseen domains (Legal, Email, Complaints) 4.1 (test)
Legal Accuracy99
16
Out-of-distribution AI-generated text detectionKimi Out-of-distribution (OOD) K2.5 (unseen domains test)
Legal Accuracy98.7
16
Out-of-distribution AI-generated text detectionMercury Out-of-distribution (OOD) 2 (test unseen domains)
Legal Accuracy95.7
16
Out-of-distribution AI-generated text detectionMistral-Medium Out-of-distribution (OOD) unseen domains 3 (test)
Accuracy (Legal)99
16
AI-generated text detectionCross-lingual and adversarial robustness benchmark Normal
Accuracy93.4
1
AI-generated text detectionCross-lingual and adversarial robustness benchmark Mixed attack
Accuracy92.1
1
Showing 10 of 12 rows

Other info

Follow for update