Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Technical Report on the Pangram AI-Generated Text Classifier

About

We present Pangram Text, a transformer-based neural network trained to distinguish text written by large language models from text written by humans. Pangram Text outperforms zero-shot methods such as DetectGPT as well as leading commercial AI detection tools with over 38 times lower error rates on a comprehensive benchmark comprised of 10 text domains (student writing, creative writing, scientific writing, books, encyclopedias, news, email, scientific papers, short-form Q&A) and 8 open- and closed-source large language models. We propose a training algorithm, hard negative mining with synthetic mirrors, that enables our classifier to achieve orders of magnitude lower false positive rates on high-data domains such as reviews. Finally, we show that Pangram Text is not biased against nonnative English speakers and generalizes to domains and models unseen during training.

Bradley Emi, Max Spero• 2024

Related benchmarks

TaskDatasetResultRank
AI-generated text detectionEssay--
8
AI Text DetectionAbstracts
AUC92.8
7
AI Text DetectionCreative Writing
AUC96.1
7
AI Text DetectionPaper Reviews
AUC0.973
7
AI Text DetectionProduct Reviews
AUC93.4
7
AI Text DetectionMultilingual texts CulturaX and Multitude V3
AUC (%)97.5
3
Showing 6 of 6 rows

Other info

Follow for update