Share your thoughts, 1 month free Claude Pro on usSee more

LLM-generated text detection on Xsum, WritingPrompts, and SQuAD generated by GPT-4.1-mini (test)

80.25AUROC

SurpMark

Updated 2mo ago

Evaluation Results

Method	Links
SurpMark 2025.10		80.25
SurpMark 2025.10		78.48
R-Detect 2025.10		71.64
Binoculars 2025.10		71.12
DetectNPR 2025.10		70.83
DetectGPT 2025.10		70.08
Fast-DetectGPT 2025.10		68.32
Lastde++ 2025.10		68.23
LogRank 2025.10		66.8
Likelihood 2025.10		66.77
DetectLRR 2025.10		63.29
FourierGPT 2025.10		63.05
Lastde 2025.10		57.28
DNA-GPT 2025.10		56.71
Entropy 2025.10		38.72