GLTR: Statistical Detection and Visualization of Generated Text
About
The rapid improvement of language models has raised the specter of abuse of text generation systems. This progress motivates the development of simple methods for detecting generated text that can be used by and explained to non-experts. We develop GLTR, a tool to support humans in detecting whether a text was generated by a model. GLTR applies a suite of baseline statistical methods that can detect generation artifacts across common sampling schemes. In a human-subjects study, we show that the annotation scheme provided by GLTR improves the human detection-rate of fake text from 54% to 72% without any prior training. GLTR is open-source and publicly deployed, and has already been widely used to detect generated outputs
Sebastian Gehrmann, Hendrik Strobelt, Alexander M. Rush• 2019
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Machine-generated text detection | MGT benchmark Essay | -- | 129 | |
| Machine-generated text detection | MGT benchmark Reuters | -- | 45 | |
| AI-generated text detection | AcademicResearch | AUC95.6 | 36 | |
| Machine-generated text detection | Grover (test) | Accuracy62.26 | 36 | |
| AI-generated text detection | Cross-genre (test) | OA97.5 | 32 | |
| AIGT detection | HC3 PWWS attack, AI to Human (in-domain) | Overall Accuracy97 | 28 | |
| AI-generated text detection | mixed-source AI -> Human GPT-2, GPT-Neo, GPT-J, LLaMa, GPT-3 | Overall Accuracy76.5 | 26 | |
| LLM-generated text detection | EvoBench | LLaMA3 Score68.49 | 26 | |
| AI-generated text detection | LiteratureCreativeWriting | AUC98.4 | 24 | |
| AI-generated text detection | Business | AUC89.9 | 24 |
Showing 10 of 116 rows
...