Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

About

Detecting text generated by modern large language models is thought to be hard, as both LLMs and humans can exhibit a wide range of complex behaviors. However, we find that a score based on contrasting two closely related language models is highly accurate at separating human-generated and machine-generated text. Based on this mechanism, we propose a novel LLM detector that only requires simple calculations using a pair of pre-trained LLMs. The method, called Binoculars, achieves state-of-the-art accuracy without any training data. It is capable of spotting machine text from a range of modern LLMs without any model-specific modifications. We comprehensively evaluate Binoculars on a number of text sources and in varied situations. Over a wide range of document types, Binoculars detects over 90% of generated samples from ChatGPT (and other LLMs) at a false positive rate of 0.01%, despite not being trained on any ChatGPT data.

Abhimanyu Hans, Avi Schwarzschild, Valeriia Cherepanova, Hamid Kazemi, Aniruddha Saha, Micah Goldblum, Jonas Geiping, Tom Goldstein• 2024

Related benchmarks

TaskDatasetResultRank
AI-generated text detectionAcademicResearch
AUC98.9
36
AI-generated text detectionMedicalText
AUC0.985
24
AI-generated text detectionArtCulture
AUC0.975
24
AI-generated text detectionEducationMaterial
AUC1
24
AI-generated text detectionEntertainment
AUC1
24
AI-generated text detectionEnvironmental
AUC99.7
24
AI-generated text detectionFinance
AUC0.993
24
AI-generated text detectionBusiness
AUC97.8
24
AI-generated text detectionLegalDocument
AUC0.998
24
AI-generated text detectionLiteratureCreativeWriting
AUC99.5
24
Showing 10 of 152 rows
...

Other info

Follow for update