Technical Report on the Pangram AI-Generated Text Classifier

About

We present Pangram Text, a transformer-based neural network trained to distinguish text written by large language models from text written by humans. Pangram Text outperforms zero-shot methods such as DetectGPT as well as leading commercial AI detection tools with over 38 times lower error rates on a comprehensive benchmark comprised of 10 text domains (student writing, creative writing, scientific writing, books, encyclopedias, news, email, scientific papers, short-form Q&A) and 8 open- and closed-source large language models. We propose a training algorithm, hard negative mining with synthetic mirrors, that enables our classifier to achieve orders of magnitude lower false positive rates on high-data domains such as reviews. Finally, we show that Pangram Text is not biased against nonnative English speakers and generalizes to domains and models unseen during training.

Bradley Emi, Max Spero• 2024

Related benchmarks

Task	Dataset	Result
AI-generated text detection	Essay	--	35
AI Text Detection	Abstracts	AUC92.8	7
AI Text Detection	Creative Writing	AUC96.1	7
AI Text Detection	Paper Reviews	AUC0.973	7
AI Text Detection	Product Reviews	AUC93.4	7
AI Text Detection	Multilingual texts CulturaX and Multitude V3	AUC (%)97.5	3

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord