Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GLTR: Statistical Detection and Visualization of Generated Text

About

The rapid improvement of language models has raised the specter of abuse of text generation systems. This progress motivates the development of simple methods for detecting generated text that can be used by and explained to non-experts. We develop GLTR, a tool to support humans in detecting whether a text was generated by a model. GLTR applies a suite of baseline statistical methods that can detect generation artifacts across common sampling schemes. In a human-subjects study, we show that the annotation scheme provided by GLTR improves the human detection-rate of fake text from 54% to 72% without any prior training. GLTR is open-source and publicly deployed, and has already been widely used to detect generated outputs

Sebastian Gehrmann, Hendrik Strobelt, Alexander M. Rush• 2019

Related benchmarks

TaskDatasetResultRank
Machine-generated text detectionMGT benchmark Essay--
129
Machine-generated text detectionMGT benchmark Reuters--
45
AI-generated text detectionAcademicResearch
AUC95.6
36
Machine-generated text detectionGrover (test)
Accuracy62.26
36
AI-generated text detectionCross-genre (test)
OA97.5
32
Machine-generated text detectionSQuAD
AUROC71
30
Machine-generated text detectionWritingPrompts
AUROC0.82
30
Machine-generated text detectionXsum
AUROC75
30
Author AttributionGHOSTWRITEBENCH OOD-Author
Macro F1 Score44
28
AIGT detectionHC3 PWWS attack, AI to Human (in-domain)
Overall Accuracy97
28
Showing 10 of 139 rows
...

Other info

Follow for update