Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM-generated content detection on Reuter News
Loading...
3.91
Std Dev of F1 Score
DSIPA (SDP)
3.7908
4.5954
5.4
6.2046
Apr 29, 2026
Std Dev of F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Std Dev of F1 Score
DSIPA (SDP)
variant=SDP
2026.04
3.91
R-Detect
2026.04
4.15
DSIPA (SDC)
variant=SDC
2026.04
4.38
Binoculars
2026.04
4.42
Fast-DetectGPT
2026.04
4.76
LogRank
2026.04
4.85
RoBERTa-large
model size=large
2026.04
5.64
Ghostbuster
2026.04
5.93
RoBERTa-base
model size=base
2026.04
6.03
RAIDAR
2026.04
6.21
DetectGPT
2026.04
6.74
GPT-Zero
2026.04
6.89
Feedback
Search any
task
Search any
task